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<220> 

<223> Description of Artificial Sequence: replicon 
<220> 

<223> Inventor: Wakita, Takaji 
Inventor: Kato, Takanobu 
Inventor: Date, Tomoko 



<400> 1 



accugccccu 


aauaggggcg 


acacuccgcc 


augaaucacu 


ccccugugag 


gaacuacugu 


60 


cuucacgcag 


aaagcgccua 


gccauggcgu 


uaguaugagu 


gucguacagc 


cuccaggccc 


120 


cccccucccg 


ggagagccau 


aguggucugc 


ggaaccggug 


aeuacaccge 


*—* *—* *-* *>* ^5 £^ 


180 


aagacugggu 


ccuuucuugg 


auaaacccac 


ucuaugcccg 


gccauuuergg 


ceugcccccg 

**** w» -w V-' 


240 


caagacugcu 


agccgaguag 


cguuggguug 


cgaaaggccu 


ugugguacug 


ccugauaggg 


300 


cgcuugcgag 


ugccccggga 


ggucucguag 


accgugcacc 


augagcacaa 


auccuaaacc 


360 


ucaaagaaaa 


accaaaagaa 


acaccaaccg 


ucgcccaaug 


auugaacaag 


auggauugca 


420 


cgcagguucu 


ccggccgcuu 


ggguggagag 


gcuauucggc 


uaugacuggg 


cacaacagac 


480 


aaucggcugc 


ucugaugccg 


ccguguuccg 


gcugucagcg 


caggggcgcc 


cgguucuuuu 


540 


ugucaagacc 


gaccuguccg 


gugcccugaa 


ugaacugcag 


gacgaggcag 


cgcggcuauc 


600 


guggcuggcc 


acgacgggcg 


uuccuugcgc 


agcugugcuc 


gacguuguca 


cugaagcggg 


660 


aagggacugg 


cugcuauugg 


gcgaagugcc 


ggggcaggau 


cuccugucau 


cucaccuugc 


720 


uccugccgag 


aaaguaucca 


ucauggcuga 


ugcaaugcgg 


cggcugcaua 


cgcuugaucc 


780 


ggcuaccugc 


ccauucgacc 


accaagcgaa 


acaucgcauc 


gagcgagcac 


guacucggau 


840 


ggaagccggu 


cuugucgauc 


aggaugaucu 


ggacgaagag 


caucaggggc 


ucgcgccagc 


900 


cgaacuguuc 


gccaggcuca 


aggcgcgcau 


gcccgacggc 


gaggaucucg 


ucgugaccca 


960 


uggcgaugcc 


ugcuugccga 


auaucauggu 


ggaaaauggc 


cgcuuuucug 


gauucaucga 


1020 


cuguggccgg 


cugggugugg 


cggaccgcua 


ucaggacaua 


gcguuggcua 


cccgugauau 


1080 


ugcugaagag 


cuuggcggcg 


aaugggcuga 


ccgcuuccuc 


gugcuuuacg 


guaucgccgc 


1140 


ucccgauucg 


cagcgcaucg 


ccuucuaucg 


ccuucuugac 


gaguucuucu 


gaguuuaaac 


1200 
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ccucucccuc cccccccccu aacguuacug gccgaagccg cuuggaauaa ggccggugug 1260 
cguuugucua uauguuauuu uccaccauau ugccgucuuu uggcaaugug agggcccgga 1320 
aaccuggccc ugucuucuug acgagcauuc cuaggggucu uuccccucuc gccaaaggaa 1380 
ugcaaggucu guugaauguc gugaaggaag caguuccucu ggaagcuucu ugaagacaaa 1440 
caacgucugu agcgacccuu ugcaggcagc ggaacccccc accuggcgac aggugccucu 1500 
gcggccaaaa gccacgugua uaagauacac cugcaaaggc ggcacaaccc cagugccacg 1560 
uugugaguug gauaguugug gaaagaguca aauggcucuc cucaagcgua uucaacaagg 1620 
ggcugaagga ugcccagaag guaccccauu guaugggauc ugaucugggg ccucggugca 1680 
caugcuuuac auguguuuag ucgagguuaa aaaaacgucu aggccccccg aaccacgggg 1740 
acgugguuuu ccuuugaaaa acacgaugau accauggcuc ccaucacugc uuaugcccag 1800 
caaacacgag gccuccuggg cgccauagug gugaguauga cggggcguga caggacagaa 1860 
caggccgggg aaguccaaau ccuguccaca gucucucagu ccuuccucgg aacaaccauc 1920 
ucggggguuu uguggacugu uuaccacgga gcuggcaaca agacucuagc cggcuuacgg 1980 
gguccgguca cgcagaugua cucgagugcu gagggggacu ugguaggcug gcccagcccc 2040 
ccugggacca agucuuugga gccgugcaag uguggagccg ucgaccuaua ucuggucacg 2100 
cggaacgcug augucauccc ggcucggaga cgcggggaca agcggggagc auugcucucc 2160 
ccgagaccca uuucgaccuu gaaggggucc ucgggggggc cggugcucug cccuaggggc 2220 
cacgucguug ggcucuuccg agcagcugug ugcucucggg gcguggccaa auccaucgau 2280 
uucauccccg uugagacacu cgacguuguu acaaggucuc ccacuuucag ugacaacagc 2340 
acgccaccgg cugugcccca gaccuaucag gucggguacu ugcaugcucc aacuggcagu 2400 
ggaaagagca ccaagguccc ugucgcguau gccgcccagg gguacaaagu acuagugcuu 2460 
aaccccucgg uagcugccac ccugggguuu ggggcguacc uauccaaggc acauggcauc 2520 
aaucccaaca uuaggacugg agucaggacc gugaugaccg gggaggccau cacguacucc 2580 
acauauggca aauuucucgc cgaugggggc ugcgcuagcg gcgccuauga caucaucaua 2640 
ugcgaugaau gccacgcugu ggaugcuacc uccauucucg gcaucggaac gguccuugau 2700 
caagcagaga cagccggggu cagacuaacu gugcuggcua cggccacacc ccccggguca 2760 
gugacaaccc cccaucccga uauagaagag guaggccucg ggcgggaggg ugagaucccc 2820 
uucuauggga gggcgauucc ccuauccugc aucaagggag ggagacaccu gauuuucugc 2880 
cacucaaaga aaaaguguga cgagcucgcg gcggcccuuc ggggcauggg cuugaaugcc 2940 
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guggcauacu auagaggguu ggacgucucc auaauaccag cucagggaga uguggugguc 3000 
gucgccaccg acgcccucau gacgggguac acuggagacu uugacuccgu gaucgacugc 3060 
aauguagcgg ucacccaagc ugucgacuuc agccuggacc ccaccuucac uauaaccaca 3120 
cagacugucc cacaagacgc ugucucacgc agucagcgcc gcgggcgcac agguagagga 3180 
agacagggca cuuauaggua uguuuccacu ggugaacgag ccucaggaau guuugacagu 3240 
guagugcuuu gugagugcua cgacgcaggg gcugcguggu acgaucucac accagcggag 3300 
accaccguca ggcuuagagc guauuucaac acgcccggcc uacccgugug ucaagaccau 3360 
cuugaauuuu gggaggcagu uuucaccggc cucacacaca uagacgccca cuuccucucc 3420 
caaacaaagc aagcggggga gaacuucgcg uaccuaguag ccuaccaagc uacggugugc 3480 
gccagagcca aggccccucc cccguccugg gacgccaugu ggaagugccu ggcccgacuc 3540 
aagccuacgc uugcgggccc cacaccucuc cuguaccguu ugggcccuau uaccaaugag 3600 
gucacccuca cacacccugg gacgaaguac aucgccacau gcaugcaagc ugaccuugag 3660 
gucaugacca gcacgugggu ccuagcugga ggaguccugg cagccgucgc cgcauauugc 3720 
cuggcgacug gaugcguuuc caucaucggc cgcuugcacg ucaaccagcg agucgucguu 3780 
gcgccggaua aggagguccu guaugaggcu uuugaugaga uggaggaaug cgccucuagg 3840 
gcggcucuca ucgaagaggg gcagcggaua gccgagaugu ugaaguccaa gauccaaggc 3900 
uugcugcagc aggccucuaa gcaggcccag gacauacaac ccgcuaugca ggcuucaugg 3960 
cccaaagugg aacaauuuug ggccagacac auguggaacu ucauuagcgg cauccaauac 4020 
cucgcaggau ugucaacacu gccagggaac cccgcggugg cuuccaugau ggcauucagu 4080 
gccgcccuca ccaguccguu gucgaccagu accaccaucc uucucaacau caugggaggc 4140 
ugguuagcgu cccagaucgc accacccgcg ggggccaccg gcuuugucgu caguggccug 4200 
gugggggcug ccgugggcag cauaggccug gguaaggugc ugguggacau ccuggcagga 4260 
uauggugcgg gcauuucggg ggcccucguc gcauucaaga ucaugucugg cgagaagccc 4320 
ucuauggaag augucaucaa ucuacugccu gggauccugu cuccgggagc ccugguggug 4380 
ggggucaucu gcgcggccau ucugcgccgc cacgugggac cgggggaggg cgcgguccaa 4440 
uggaugaaca ggcuuauugc cuuugcuucc agaggaaacc acgucgcccc uacucacuac 4500 
gugacggagu cggaugcguc gcagcgugug acccaacuac uuggcucucu uacuauaacc 4560 
agccuacuca gaagacucca caauuggaua acugaggacu gccccauccc augcuccgga 4620 
uccuggcucc gcgacgugug ggacuggguu ugcaccaucu ugacagacuu caaaaauugg 4680 
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cugaccucua aauuguuccc caagcugccc ggccuccccu ucaucucuug ucaaaagggg 4740 
uacaagggug ugugggccgg cacuggcauc augaccacgc gcugcccuug cggcgccaac 4800 
aucucuggca auguccgccu gggcucuaug aggaucacag ggccuaaaac cugcaugaac 4860 
accuggcagg ggaccuuucc uaucaauugc uacacggagg gccagugcgc gccgaaaccc 4920 
cccacgaacu acaagaccgc caucuggagg guggcggccu cggaguacgc ggaggugacg 4980 
cagcaugggu cguacuccua uguaacagga cugaccacug acaaucugaa aauuccuugc 5040 
caacuaccuu cuccagaguu uuucuccugg guggacggug ugcagaucca uagguuugca 5100 
cccacaccaa agccguuuuu ccgggaugag gucucguucu gcguugggcu uaauuccuau 5160 
gcugucgggu cccagcuucc cugugaaccu gagcccgacg cagacguauu gagguccaug 5220 
cuaacagauc cgccccacau cacggcggag acugcggcgc ggcgcuuggc acggggauca 5280 
ccuccaucug aggcgagcuc cucagugagc cagcuaucag caccgucgcu gcgggccacc 5340 
ugcaccaccc acagcaacac cuaugacgug gacauggucg augccaaccu gcucauggag 5400 
ggcggugugg cucagacaga gccugagucc agggugcccg uucuggacuu ucucgagcca 5460 
auggccgagg aagagagcga ccuugagccc ucaauaccau cggagugcau gcuccccagg 5520 
agcggguuuc cacgggccuu accggcuugg gcacggccug acuacaaccc gccgcucgug 5580 
gaaucgugga ggaggccaga uuaccaaccg cccaccguug cugguugugc ucuccccccc 5640 
cccaagaagg ccccgacgcc ucccccaagg agacgccgga cagugggucu gagcgagagc 5700 
accauaucag aagcccucca gcaacuggcc aucaagaccu uuggccagcc ccceucgagc 5760 
ggugaugcag gcucguccac gggggcgggc gccgccgaau ccggcggucc gacguccccu 5820 
ggugagccgg cccccucaga gacagguucc gccuccucua ugcccccccu cgagggggag 5880 
ccuggagauc cggaccugga gucugaucag guagagcuuc aaccuccccc ccaggggggg 5940 
gggguagcuc ccgguucggg cucggggucu uggucuacuu gcuccgagga ggacgauacc 6000 
accgugugcu gcuccauguc auacuccugg accggggcuc uaauaacucc cuguagcccc 6060 
gaagaggaaa aguugccaau caacccuuug aguaacucgc uguugcgaua ccauaacaag 6120 
guguacugua caacaucaaa gagcgccuca cagagggcua aaaagguaac uuuugacagg 6180 
acgcaagugc ucgacgccca uuaugacuca gucuuaaagg acaucaagcu agcggcuucc 6240 
aaggucagcg caaggcuccu caccuuggag gaggcgugcc aguugacucc accccauucu 6300 
gcaagaucca aguauggauu cggggccaag gagguccgca gcuuguccgg gagggccguu 6360 
aaccacauca aguccgugug gaaggaccuc cuggaagacc cacaaacacc aauucccaca 6420 
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accaucaugg ccaaaaauga gguguucugc guggaccccg ccaagggggg uaagaaacca 6480 
gcucgccuca ucguuuaccc ugaccucggc guccgggucu gcgagaaaau ggcccucuau 6540 
gacauuacac aaaagcuucc ucaggcggua augggagcuu ccuauggcuu ccaguacucc 6600 
ccugcccaac ggguggagua ucucuugaaa gcaugggcgg aaaagaagga ccccaugggu 6660 
uuuucguaug auacccgaug cuucgacuca accgucacug agagagacau caggaccgag 6720 
gaguccauau accaggccug cucccugccc gaggaggccc gcacugccau acacucgcug 6780 
acugagagac uuuacguagg agggcccaug uucaacagca agggucaaac cugcgguuac 6840 
agacguugcc gcgccagcgg ggugcuaacc acuagcaugg guaacaccau cacaugcuau 6900 
gugaaagccc uagcggccug caaggcugcg gggauaguug cgcccacaau gcugguaugc 6960 
ggcgaugacc uaguagucau cucagaaagc caggggacug aggaggacga gcggaaccug 7020 
agagccuuca cggaggccau gaccagguac ucugccccuc cuggugaucc ccccagaccg 7080 
gaauaugacc uggagcuaau aacauccugu uccucaaaug ugucuguggc guugggcccg 7140 
cggggccgcc gcagauacua ccugaccaga gacccaacca cuccacucgc ccgggcugcc 7200 
ugggaaacag uuagacacuc cccuaucaau ucauggcugg gaaacaucau ccaguaugcu 7260 
ccaaccauau ggguucgcau gguccuaaug acacacuucu ucuccauucu caugguccaa 7320 
gacacccugg accagaaccu caacuuugag auguauggau caguauacuc cgugaauccu 7380 
uuggaccuuc cagccauaau ugagagguua cacgggcuug acgccuuuuc uaugcacaca 7440 
uacucucacc acgaacugac gcggguggcu ucagcccuca gaaaacuugg ggcgccaccc 7500 
cucagggugu ggaagagucg ggcucgcgca gucagggcgu cccucaucuc ccguggaggg 7560 
aaagcggccg uuugcggccg auaucucuuc aauugggcgg ugaagaccaa gcucaaacuc 7620 
acuccauugc cggaggcgcg ccuacuggac uuauccaguu gguucaccgu cggcgccggc 7680 
gggggcgaca uuuuucacag cgugucgcgc gcccgacccc gcucauuacu cuucggccua 7740 
cuccuacuuu ucguaggggu aggccucuuc cuacuccccg cucgguagag cggcacacac 7800 
uagguacacu ccauagcuaa cuguuccuuu uuuuuuuuuu uuuuuuuuuu uuuuuuuuuu 7860 
uuuuuuuuuu cuuuuuuuuu uuuuucccuc uuucuucccu ucucaucuua uucuacuuuc 7920 
uuucuuggug gcuccaucuu agcccuaguc acggcuagcu gugaaagguc cgugagccgc 7980 
augacugcag agagugccgu aacuggucuc ucugcagauc augu 8024 
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<210> 2 

<211> 8024 

<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: replicon 



<400> 2 



acccgccccu 


aauaggggcg 


acacuccgcc 


augaaucacu 


ccccugugag 


gaacuacugu 


60 


cuucacgcag 


aaagcgucua 


gccauggcgu 


uaguaugagu 


guceuacagc 

E3> *-* ^5 ^— ' 


cuccaggccc 


120 


cccccucccg 


ggagagccau 


aguggucugc 




aguacaccgg 




180 


aagacugggu 


ccuuucuugg 


auaaacccac 


UCUaURCCCR 


gccauuueeg 


cerue'ccccce 


240 


caagacugcu 


agccgaguag 


cguuggguug 


cgaaaggccu 


w» ^5 *~* ^5 


w »_» £5 *-* »-» v-» ^ O 


300 


ugcuugcgag 


ugccccggga 


ggucucguag 


accgugcacc 


augagcacaa 


aucccaaacc 


360 


ucaaagaaaa 


accaaaagaa 


acacuaaccg 


ucgcccaaug 


auugaacaag 


auegauuRca 


420 


cgcagguucu 


ccggccgcuu 


ggguggagag 


gcuauucggc 


uaugacuggg 


cacaacagac 


480 


aaucggcugc 


ucugaugccg 


ccguguuccg 


gcugucagcg 


caggggcgcc 


cgguucuuuu 


540 


ugucaagacc 


gaccuguccg 


gugcccugaa 


ugaacugcag 


gacgaggcag 


cgcggcuauc 


600 


guggcuggcc 


acgacgggcg 


uuccuugcgc 


agcugugcuc 


gacguuguca 


cugaagcggg 


660 


aagggacugg 


cugcuauugg 


gcgaagugcc 


ggggcaggau 


cuccugucau 


cucaccuugc 


720 


uccugccgag 


aaaguaucca 


ucauggcuga 


ugcaaugcgg 


cggcugcaua 


cgcuugaucc 


780 


ggcuaccugc 


ccauucgacc 


accaagcgaa 


acaucgcauc 


gagcgagcac 


guacucggau 


840 


ggaagccggu 


cuugucgauc 


aggaugaucu 


ggacgaagag 


caucaggggc 


ucgcgccagc 


900 


cgaacuguuc 


gccaggcuca 


aggcgcgcau 


gcccgacggc 


gaggaucucg 


ucgugaccca 


960 


uggcgaugcc 


ugcuugccga 


auaucauggu 


ggaaaauggc 


cgcuuuucug 


gauucaucga 


1020 


cuguggccgg 


cugggugugg 


cggaccgcua 


ucaggacaua 


gcguuggcua 


cccgugauau 


1080 


ugcugaagag 


cuuggcggcg 


aaugggcuga 


ccgcuuccuc 


gugcuuuacg 


guaucgccgc 


1140 


ucccgauucg 


cagcgcaucg 


ccuucuaucg 


ccuucuugac 


gaguucuucu 


gaguuuaaac 


1200 
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ccucucccuc cccccccccu aacguuacug gccgaagccg cuuggaauaa ggccggugug 1260 
cguuugucua uauguuauuu uccaccauau ugccgucuuu uggcaaugug agggcccgga 1320 
aaccuggccc ugucuucuug acgagcauuc cuaggggucu uuccccucuc gccaaaggaa 1380 
ugcaaggucu guugaauguc gugaaggaag caguuccucu ggaagcuucu ugaagacaaa 1440 
caacgucugu agcgacccuu ugcaggcagc ggaacccccc accuggcgac aggugccucu 1500 
gcggccaaaa gccacgugua uaagauacac cugcaaaggc ggcacaaccc cagugccacg 1560 
uugugaguug gauaguugug gaaagaguca aauggcucuc cucaagcgua uucaacaagg 1620 
ggcugaagga ugcccagaag guaccccauu guaugggauc ugaucugggg ccucggugca 1680 
caugcuuuac auguguuuag ucgagguuaa aaaaacgucu aggccccccg aaccacgggg 1740 
acgugguuuu ccuuugaaaa acacgauaau accauggccc ccaucaccgc uuacgcccag 1800 
cagacacgag gucucuuggg cucuauagug gugagcauga cggggcguga caagacagaa 1860 
caggccgggg agguccaagu ccuguccaca gucacucagu ccuuccucgg aacauccauu 1920 
ucgggggucu uauggacugu uuaccacgga gcuggcaaca agacacuagc cggcucgcgg 1980 
ggcccgguca cgcagaugua cucgagcgcc gagggggacu uggucgggug gcccagcccu 2040 
ccugggacca aaucuuugga gccguguacg uguggagcgg ucgaccugua uuuggucacg 2100 
cggaacgcug augucauccc ggcucgaaga cgcggggaca agcggggagc gcugcucucc 2160 
ccgagacccc uuucgaccuu gaaggggucc ucggggggac cugugcuuug cccuaggggc 2220 
cacgcugucg gaaucuuccg ggcagcugug ugcucucggg guguggcuaa guccauagau 2280 
uucauccccg uugagacgcu cgacaucguc acgcggucuc ccaccuuuag ugacaacagc 2340 
acaccaccag cugugcccca gaccuaucag gugggguacu ugcacgcccc cacuggcagu 2400 
ggaaaaagca ccaagguccc cgucgcguac gccgcccagg gguauaaagu gcuggugcuc 2460 
aaucccucgg uggcugccac ccugggauuu ggggcguacu uguccaaggc acauggcauc 2520 
aaccccaaca uuaggacugg agucagaacu gugacgaccg gggagcccau uacauacucc 2580 
acguauggua aauuccucgc cgaugggggc ugcgcaggcg gcgccuauga caucaucaua 2640 
ugcgaugaau gccacucugu ggaugcuacc acuauucucg gcaucgggac aguccuugac 2700 
caagcagaga cagccggggu caggcuaacu guacuggcca cggccacgcc ccccgggucg 2760 
gugacaaccc cccaucccaa uauagaggag guagcccucg gacaggaggg ugagaucccc 2820 
uucuauggga gggcguuucc ccugucuuac aucaagggag ggaggcacuu gauuuucugc 2880 
cacucaaaga aaaaguguga cgagcucgca acggcccuuc ggggcauggg cuugaacgcu 2940 
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guggcauauu acagaggguu ggacgucucc auaauaccaa cucaaggaga uguggugguc 3000 
guugccaccg acgcccucau gacgggguau acuggagacu uugacuccgu gaucgacugc 3060 
aacguagcgg ucacccaggc cguagacuuc agccuggacc ccaccuucac uauaaccaca 3120 
cagacugucc cgcaagacgc ugucucacgu agucagcgcc gagggcgcac ggguagagga 3180 
agacugggca uuuauaggua uguuuccacu ggugagcgag ccucaggaau guuugacagu 3240 
guaguacucu gugagugcua cgacgcagga gcugcuuggu augagcucuc accaguggag 3300 
acgaccguca ggcucagggc guauuucaac acgccuggcu ugccugugug ccaggaccac 3360 
cuugaguuuu gggaggcagu uuucaccggc cucacacaca uagacgcuca uuuccuuucc 3420 
cagacaaagc agucggggga aaauuucgca uacuuaguag ccuaucaggc cacagugugc 3480 
gccagggcca aagcgccccc cccguccugg gacgucaugu ggaagugcuu gacucgacuc 3540 
aagcccacgc uugugggccc uacaccucuc cuguaccguu ugggcucugu uaccaacgag 3600 
gucacccuua cacaccccgu gacaaaauac aucgccacau gcaugcaagc ugaccucgag 3660 
gucaugacca gcacgugggu ccuggcuggg ggagucuuag cagccgucgc cgcguauugc 3720 
uuagcgaccg gguguguuuc caucauuggc cguuuacaca ucaaccagcg agcugucguc 3780 
gcuccggaca aggagguccu cuaugaggcu uuugaugaga uggaggaaug ugccuccaga 3840 
gcggcucucc uugaagaggg gcagcggaua gccgagaugc ugaaguccaa gauccaaggc 3900 
uuauugcagc aagccucuaa acaggcccag gacauacaac ccgcugugca agcuucgugg 3960 
cccaagaugg agcaauucug ggccaaacau auguggaacu ucauaagcgg cauucaguac 4020 
cucgcaggac ugucaacacu gccagggaac ccugcugugg cuuccaugau ggcauucagc 4080 
gccgcccuca ccaguccguu gucaacuagc accaccaucc uucuuaacau ucuggggggc 4140 
uggcuggcgu cccaaauugc gccacccgcg ggggccacug gcuuuguugu caguggccug 4200 
gugggagcug cuguuggcag cauaggcuug gguaaagugc ugguggacau ccuggcaggg 4260 
uauggugcgg gcauuucggg ggcccucguc gcguuuaaga ucaugucugg cgagaagccc 4320 
uccauggagg augucaucaa cuugcugccu gggauucugu cuccaggugc ucugguggug 4380 
ggagucaucu gcgcggccau ucugcgccgc caugugggac cgggggaagg cgcgguccaa 4440 
uggaugaaca ggcuuaucgc cuucgcuucc agaggaaacc acgucgcccc uacucacuac 4500 
gugacggagu cggaugcguc gcagcguguc acccaacugc uuggcucucu cacuauaacu 4560 
agucuacuca ggagacuuca caacuggauc acugaggauu gccccauccc augcgccggc 4620 
ucguggcucc gcgaugugug ggacuggguc uguaccaucc uaacagacuu uaagaacugg 4680 
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cugaccucca agcuguuccc aaagaugccu 
uacaagggcg ugugggccgg cacuggcauc 
aucucuggca acguccgcuu gggcucuaug 
accuggcagg ggaccuuucc uaucaauugu 
gcguuaaacu ucaagaccgc caucuggaga 
cagcacggau cauaugccua uauaacaggg 
caacuccccu cuccagaguu uuucucuugg 
cccacaccaa agccguuuuu ccgggaugag 
gucgucgggu cucagcuucc cugugacccu 
cuaacagacc caucccauau cacggcggag 
cccccaucug aggcaagcuc cucagcgagc 
ugcaccaccc acgguaggac cuaugaugug 
ggcggcguga uucggauaga gucugagucc 
augaccgagg aagagggcga ccuugagccu 
aagagguucc caccggccuu accggcuugg 
gaaucgugga agaggccaga uuaccaacca 
cccaaaaaga ccccgacgcc uccuccaagg 
accauaggag augcccucca acagcuggcc 
ggcgauucag gccuuuccac gggggcggac 
gacgaguugg cucuuucgga gacagguucu 
ccuggggacc cagaccugga gccugagcag 
gaggcagcuc ccggcucgga cucggggucc 
gucgugugcu gcuccauguc auauuccugg 
gaagaggaaa aguugccaau uaacuccuug 
guauacugua cuacaucaaa gagugccuca 
augcaagugc ucgacgccua uuaugauuca 
aaggucagcg caaggcuccu caccuuagag 
gcaagaucca aguauggguu uggggcuaag 
aaccacauca aguccgugug gaaggaccuc 



ggccuccccu uuaucucuug ccaaaagggg 4740 
augaccacac gaugccccug cggcgccaac 4800 
agaaucacag gacccaaaac cugcaugaac 4860 
uauacagaag gccagugcuu gccgaaaccc 4920 
guggcggccu cagaguacgc ggaagugacg 4980 
cugaccacug acaacuuaaa agucccuugc 5040 
guggacggag uacaaaucca uagguccgcc 5100 
gucucguuca gcguugggcu caauucauuu 5160 
gagcccgaca cugagguagu gauguccaug 5220 
gcugcagcgc ggcguuuagc gcggggguca 5280 
cagcugucgg cgccaucgcu gcgagccacc 5340 
gacauggugg augccaaccu guucaugggg 5400 
aaaguggucg uucuggacuc ccucgacuca 5460 
ucaguaccau cggaguauau gcuccccagg 5520 
gcgcggccug auuacaaccc accgcuugug 5580 
cccacuguug cgggcugugc ucuccccccc 5640 
agacgccgga cagugggucu gagcgagagc 5700 
aucaaguccu uuggccagcc ccccccaagc 5760 
gccgccgacu ccggcgaucg gacacccccu 5820 
accuccucca ugcccccccu cgagggggag 5880 
guagagcuuc aaccuccucc ccaggggggg 5940 
uggucuacuu gcuccgagga ggaugacucc 6000 
accggggcuc uaauaacucc uuguagcccc 6060 
agcaacucgc uguugcgaua ccauaacaag 6120 
cuaagggcua aaaagguaac uuuugauagg 6180 
gucuuaaagg acaucaagcu agcggccucc 6240 
gaggcgugcc aauugacccc accccacucu 6300 
gagguccgca gcuuguccgg gagggccguc 6360 
uuggaagacu cacaaacacc aauuccuaca 6420 
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accaucaugg ccaaaaauga gguguucugc guggaccccg ccaagggggg uaaaaaacca 6480 
gcucgccuua ucguuuaccc ugaccucggc gucagggucu gcgagaagau ggcccuuuau 6540 
gaugucacac aaaagcuucc ucaggcggug augggggcuu cuuauggcuu ccaguacucc 6600 
cccgcucagc ggguggaguu ucucuugaag gcaugggcgg aaaagagaga cccuaugggu 6660 
uuuucguaug auacccgaug cuuugacuca accgucacug agagagacau caggacugag 6720 
gaguccauau accaggccug cuccuuaccc gaggaggccc gaacugccau acacucgcug 6780 
acugagagac ucuauguggg agggcccaug uucaacagca agggccaguc cugcggguac 6840 
aggcguugcc gcgccagcgg ggugcuuacc acuaguaugg ggaacaccau cacaugcuau 6900 
guaaaagccc uagcggcuug caaggcugcg gggauaauug cgcccacgau gcugguaugc 6960 
ggcgacgacu uggucgucau cucagaaagc caggggacug aggaggacga gcggaaccug 7020 
agagccuuca cggaggcuau gaccagguau ucugccccuc cuggugaccc ccccagaccg 7080 
gaauaugacc uggagcuaau aacaucuugu uccucaaacg ugucuguggc acuuggccca 7140 
cagggccgcc gcagauacua ccugaccaga gaccccacca cuucaauugc ccgggcugcc 7200 
ugggaaacag uuagacacuc cccugucaau ucauggcugg gaaacaucau ccaguacgcu 7260 
ccaaccauau ggguucgcau gguccugaug acacacuucu ucuccauucu cauggcccag 7320 
gacacccuag accagaaccu uaacuuugaa auguacggau cgguguacuc cgugaguccu 7380 
cuggaccucc cagccauaau ugaaagguua cacgggcuug acgccuucuc ucugcacaca 7440 
uacacucccc acgaacugac gcggguggcu ucagcccuca gaaaacuugg ggcgccaccc 7500 
cucagagcgu ggaagagucg ggcgcgugca guuagggcgu cccucaucuc ccgugggggg 7560 
agggcggccg uuugcggucg guaccucuuc aacugggcgg ugaagaccaa gcucaaacuc 7620 
acuccuuugc cggaggcacg ccuccuggau uuguccaguu gguuuaccgu cggcgccggc 7680 
gggggcgaca uuuaucacag cgugucgcgu gcccgacccc gccuauuacu ccuuagccua 7740 
cuccuacuuu cuguaggggu aggccucuuc cuacuccccg cucgauagag cggcacacau 7800 
uagcuacacu ccauagcuaa cuguuccuuu uuuuuuuuuu uuuuuuuuuu uuuuuuuuuu 7860 
uuuuuuuuuu cuuuuuuuuu uuuuucccuc uuucuucccu ucucaucuua uucuacuuuc 7920 
uuucuuggug gcuccaucuu agcccuaguc acggcuagcu gugaaagguc cgugagccgc 7980 
augacugcag agagugccgu aacuggucuc ucugcagauc augu 8024 
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<210> 3 
<211> 9678 
<212> DNA 

<213> Hepatitis C virus 

<220> 
<221> CDS 

<222> (341). . (9442) 
<400> 3 

acctgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 60 

cttcacgcag aaagcgccta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 120 

ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtacaccgg aattgccggg 180 

aagactgggt cctttcttgg ataaacccac tctatgcccg gccatttggg cgtgcccccg 240 

caagactgct agccgagtag cgttgggttg cgaaaggcct tgtggtactg cctgataggg 300 

cgcttgcgag tgccccggga ggtctcgtag accgtgcacc atg age aca aat cct 355 

Met Ser Thr Asn Pro 

1 5 

aaa cct caa aga aaa acc aaa aga aac acc aac cgt cgc cca gaa gac 403 

Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn Arg Arg Pro Glu Asp 

10 15 20 

gtt aag ttc ccg ggc ggc ggc cag ate gtt ggc gga gta tac ttg ttg 451 

12/117 



Val Lys Phe Pro Gly Gly Gly Gin He Val Gly Gly Val Tyr Leu Leu 
25 30 35 

ccg cgc agg ggc ccc agg ttg ggt gtg cgc acg aca agg aaa act teg 499 
Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Thr Thr Arg Lys Thr Ser 
40 45 50 

gag egg tec cag cca cgt ggg aga cgc cag ccc ate ccc aaa gat egg 547 
Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro He Pro Lys Asp Arg 
55 60 65 

cgc tec act ggc aag gee tgg gga aaa cca ggt cgc ccc tgg ccc eta 595 
Arg Ser Thr Gly Lys Ala Trp Gly Lys Pro Gly Arg Pro Trp Pro Leu 
70 75 80 85 

tat ggg aat gag gga etc ggc tgg gca gga tgg etc ctg tec ccc cga 643 
Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp Leu Leu Ser Pro Arg 

90 95 100 

ggc tct cgc ccc tec tgg ggc ccc act gac ccc egg cat agg teg cgc 691 
Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro Arg His Arg Ser Arg 
105 110 115 

aac gtg ggt aaa gtc ate gac ace eta acg tgt ggc ttt gec gac etc 739 
Asn Val Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
120 125 130 

atg ggg tac ate ccc gtc gta ggc gec ccg ctt agt ggc gec gee aga 787 
Met Gly Tyr He Pro Val Val Gly Ala Pro Leu Ser Gly Ala Ala Arg 
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135 140 145 

get gtc gcg cac ggc gtg aga gtc ctg gag gac ggg gtt aat tat gca 835 
Ala Val Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
150 155 160 165 

aca ggg aac eta ccc ggt ttc ccc ttt tct ate ttc ttg ctg gee ctg 883 
Thr Gly Asn Leu Pro Gly Phe Pro Phe Ser He Phe Leu Leu Ala Leu 
170 175 180 

ttg tec tgc ate acc gtt ccg gtc tct get gec cag gtg aag aat acc 931 
Leu Ser Cys He Thr Val Pro Val Ser Ala Ala Gin Val Lys Asn Thr 
185 190 195 

agt age age tac atg gtg acc aat gac tgc tec aat gac age ate act 979 
Ser Ser Ser Tyr Met Val Thr Asn Asp Cys Ser Asn Asp Ser He Thr 
200 205 210 

tgg cag etc gag get gcg gtt etc cac gtc ccc ggg tgc gtc ccg tgc 1027 
Trp Gin Leu Glu Ala Ala Val Leu His Val Pro Gly Cys Val Pro Cys 
215 220 225 

gag aga gtg ggg aat acg tea egg tgt tgg gtg cca gtc teg cca aac 1075 
Glu Arg Val Gly Asn Thr Ser Arg Cys Trp Val Pro Val Ser Pro Asn 
230 235 240 245 

atg get gtg egg cag ccc ggt gee etc acg cag ggt ctg egg acg cac 1123 
Met Ala Val Arg Gin Pro Gly Ala Leu Thr Gin Gly Leu Arg Thr His 
250 255 260 
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ate gat atg 
He Asp Met 

ggg gac etc 
Gly Asp Leu 
280 

teg ccg cag 
Ser Pro Gin 
295 

cct ggc acc 
Pro Gly Thr 
310 

tgg teg ccc 
Trp Ser Pro 

gag gtc ate 
Glu Val He 

ggc ttg gee 
Gly Leu Ala 
360 



gtt gtg atg 
Val Val Met 
265 

tgt ggc ggg 
Cys Gly Gly 

tac cac tgg 
Tyr His Trp 

ate act gga 
He Thr Gly 
315 

acg gee acc 
Thr Ala Thr 
330 

ata gac ate 
He Asp He 
345 

tac ttc tct 
Tyr Phe Ser 



tec gec acc 
Ser Ala Thr 
270 

gtg atg etc 
Val Met Leu 
285 

ttt gtg caa 
Phe Val Gin 
300 

cac cgc atg 
His Arg Met 

atg ate ctg 
Met He Leu 

gtt age ggg 
Val Ser Gly 
350 

atg cag gga 
Met Gin Gly 
365 



ttc tgc tct 
Phe Cys Ser 

gcg gee cag 
Ala Ala Gin 

gaa tgc aat 
Glu Cys Asn 
305 

gca tgg gac 
Ala Trp Asp 
320 

gcg tac gtg 
Ala Tyr Val 
335 

get cac tgg 
Ala His Trp 

gcg tgg gcg 
Ala Trp Ala 



get etc tac 
Ala Leu Tyr 
275 

gtg ttc ate 
Val Phe He 
290 

tgc tec ate 
Cys Ser He 

atg atg atg 
Met Met Met 

atg cgc gtc 
Met Arg Val 
340 

ggc gtc atg 
Gly Val Met 
355 

aag gtc att 
Lys Val He 
370 



gtg 1171 
Val 



gtc 1219 
Val 

tac 1267 
Tyr 

aac 1315 

Asn 

325 

ccc 1363 
Pro 



ttc 1411 
Phe 



gtc 1459 
Val 
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ate ctt ctg 
lie Leu Leu 
375 

ggc get gtt 
Gly Ala Val 
390 

ggc cct cag 
Gly Pro Gin 

ate aac cgt 
lie Asn Arg 

etc gcg gee 
Leu Ala Ala 
440 

ggg cgc ctg 
Gly Arg Leu 
455 

ggc acc eta 
Gly Thr Leu 
470 



ctg gee get 
Leu Ala Ala 

gca cgt tec 
Ala Arg Ser 
395 

cag aac att 
Gin Asn lie 
410 

act gec ttg 
Thr Ala Leu 
425 

ttg ttc tac 
Leu Phe Tyr 

tec gee tgc 
Ser Ala Cys 

cag tac gag 
Gin Tyr Glu 
475 



ggg gtg gac 
Gly Val Asp 
380 

acc aac gtg 
Thr Asn Val 

cag etc att 
Gin Leu lie 

aat tgc aat 
Asn Cys Asn 
430 

acc aac cgc 
Thr Asn Arg 
445 

cgc aac ate 
Arg Asn lie 
460 

gat aat gtc 
Asp Asn Val 



gcg ggc acc 
Ala Gly Thr 
385 

att gee ggc 
He Ala Gly 
400 

aac acc aac 
Asn Thr Asn 
415 

gac tec ttg 
Asp Ser Leu 

ttt aac teg 
Phe Asn Ser 

gag get ttc 
Glu Ala Phe 
465 

acc aat cca 
Thr Asn Pro 
480 



acc acc gtt 
Thr Thr Val 



gtg ttc age 
Val Phe Ser 



ggc agt tgg 
Gly Ser Trp 
420 

aac acc ggc 
Asn Thr Gly 
435 

tea ggg tgt 
Ser Gly Cys 
450 

egg ata ggg 
Arg He Gly 

gag gat atg 
Glu Asp Met 



gga 1507 
Gly 

cat 1555 

His 

405 

cac 1603 
His 



ttt 1651 
Phe 



cca 1699 
Pro 



tgg 1747 
Trp 

agg 1795 

Arg 

485 



ccg tac tgc tgg cac tac ccc cca aag ccg tgt ggc gta gtc ccc gcg 1843 
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Pro Tyr Cys Trp His Tyr Pro Pro Lys Pro Cys Gly Val Val Pro Ala 
490 495 500 



agg tct gtg 
Arg Ser Val 

gtg ggc acg 
Val Gly Thr 
520 

aat gag aca 
Asn Glu Thr 
535 

tea tgg ttc 
Ser Trp Phe 
550 

tgt ggc gcg 
Cys Gly Ala 

gac ttg ttg 
Asp Leu Leu 

tat att aag 
Tyr lie Lys 



tgt ggc cca 
Cys Gly Pro 
505 

acc gac aga 
Thr Asp Arg 

gat gtc ttc 
Asp Val Phe 

ggc tgc acg 
Gly Cys Thr 
555 

cca cct tgc 
Pro Pro Cys 
570 

tgc cct acg 
Cys Pro Thr 
585 

tgt ggt tct 
Cys Gly Ser 



gtg tac tgt 
Val Tyr Cys 
510 

cgt gga gtg 
Arg Gly Val 
525 

eta ctg aac 
Leu Leu Asn 
540 

tgg atg aac 
Trp Met Asn 

cgc acc aga 
Arg Thr Arg 

gat tgt ttt 
Asp Cys Phe 
590 

ggg ccc tgg 
Gly Pro Trp 



ttc acc ccc 
Phe Thr Pro 

ccc acc tac 
Pro Thr Tyr 

age acc cga 
Ser Thr Arg 
545 

tec act ggt 
Ser Thr Gly 
560 

get gac ttc 
Ala Asp Phe 
575 

agg aag cat 
Arg Lys His 

etc aca cca 
Leu Thr Pro 
117 



age ccg gta 
Ser. Pro Val 
515 

aca tgg gga 

Thr Trp Gly 
530 

ccg ccg cag 

Pro Pro Gin 



ttc acc aag 
Phe Thr Lys 

aac gee age 
Asn Ala Ser 
580 

cct gat gee 
Pro Asp Ala 
595 

aag tgc ctg 
Lys Cys Leu 



gta 1891 
Val 



gag 1939 
Glu 



ggc 1987 
Gly 

act 2035 

Thr 

565 

acg 2083 
Thr 



act 2131 
Thr 



gtc 2179 
Val 



600 

cac tac cct 
His Tyr Pro 
615 

ate ttc aag 
lie Phe Lys 
630 

gec gca tgc 
Ala Ala Cys 

gac agg agt 
Asp Arg Ser 

ate ctg ccc 
lie Leu Pro 
680 

etc cac ctt 
Leu His Leu 
695 

tea cct get 
Ser Pro Ala 
710 



tac aga etc 
Tyr Arg Leu 

ata aga atg 
lie Arg Met 
635 

aac ttc act 
Asn Phe Thr 
650 

cag ctg tct 
Gin Leu Ser 
665 

tgc ace tac 
Cys Thr Tyr 

cac cag aac 
His Gin Asn 

ate aca aaa 
lie Thr Lys 
715 



605 

tgg cat tac 

Trp His Tyr 
620 

tat gta ggg 

Tyr Val Gly 

cgt ggg gat 
Arg Gly Asp 

cct ctg ttg 
Pro Leu Leu 
670 

tea gac tta 
Ser Asp Leu 
685 

ate gtg gac 
lie Val Asp 
700 

tac gtc gtt 
Tyr Val Val 



ccc tgc aca 
Pro Cys Thr 
625 

ggg gtt gag 
Gly Val Glu 
640 

cgc tgc gac 
Arg Cys Asp 
655 

cac tct ace 
His Ser Thr 

ccc get ttg 
Pro Ala Leu 

gta caa tac 
Val Gin Tyr 
705 

cga tgg gag 
Arg Trp Glu 
720 



610 

gtc aat ttt 
Val Asn Phe 

cac agg etc 
His Arg Leu 

ttg gag gac 
Leu Glu Asp 
660 

acg gaa tgg 
Thr Glu Trp 
675 

tea act ggt 
Ser Thr Gly 
690 

atg tat ggc 
Met Tyr Gly 



tgg gtg gta 
Trp Val Val 



acc 2227 
Thr 



acg 2275 

Thr 

645 

agg 2323 
Arg 

gee 2371 
Ala 

ctt 2419 
Leu 

etc 2467 
Leu 

etc 2515 

Leu 

725 
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tta ttc ctg 
Leu Phe Leu 

etc ate ttg 
Leu lie Leu 

ttg cac get 
Leu His Ala 
760 

ate ttc ttc 
He Phe Phe 
775 

acc ace tat 
Thr Thr Tyr 
790 

gca ctg ccc 
Ala Leu Pro 

ata ggc gtg 
lie Gly Val 



etc tta gcg 
Leu Leu Ala 
730 

ttg ggc cag 
Leu Gly Gin 
745 

gcg agt gcg 
Ala Ser Ala 

gtg gca get 
Val Ala Ala 

tgc etc act 
Cys Leu Thr 
795 

egg cag get 
Arg Gin Ala 
810 

ggt ttg ttg 
Gly Leu Leu 
825 



gac gec aga 
Asp Ala Arg 

gec gaa gca 
Ala Glu Ala 
750 

get aac tgc 
Ala Asn Cys 
765 

tgg cac ate 
Trp His He 
780 

ggc eta tgg 
Gly Leu Trp 

tat gec tat 
Tyr Ala Tyr 

ata ttg ate 
He Leu lie 
830 



gtc tgc gec 
Val Cys Ala 
735 

gca ttg gag 
Ala Leu Glu 

cat ggc etc 
His Gly Leu 

agg ggt egg 
Arg Gly Arg 
785 

ccc ttc tgc 
Pro Phe Cys 
800 

gac gca cct 
Asp Ala Pro 
815 

acc etc ttc 
Thr Leu Phe 



tgc ttg tgg 
Cys Leu Trp 
740 

aag ttg gtc 
Lys Leu Val 
755 

eta tat ttt 
Leu Tyr Phe 
770 

gtg gtc ccc 
Val Val Pro 

eta ctg etc 
Leu Leu Leu 

gtg cac gga 
Val His Gly 
820 

aca etc acc 
Thr Leu Thr 
835 



atg 2563 
Met 

gtc 2611 
Val 

gee 2659 
Ala 

ttg 2707 
Leu 

atg 2755 

Met 

805 

cag 2803 
Gin 



ccg 2851 
Pro 
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ggg tat aag acc etc etc ggc cag tgt 
Gly Tyr Lys Thr Leu Leu Gly Gin Cys 
840 845 



ctg tgg tgg ttg tgc 
Leu Trp Trp Leu Cys 
850 



tat etc 
Tyr Leu 



2899 



ctg acc ctg ggg gaa gec atg att cag gag tgg gta cca ccc atg cag 2947 

Leu Thr Leu Gly Glu Ala Met He Gin Glu Trp Val Pro Pro Met Gin 
855 860 865 

gtg cgc ggc ggc cgc gat ggc ate gcg tgg gec gtc act ata ttc tgc 2995 

Val Arg Gly Gly Arg Asp Gly He Ala Trp Ala Val Thr He Phe Cys 

870 . 875 880 885 

ccg ggt gtg gtg ttt gac att acc aaa tgg ctt ttg gcg ttg ctt ggg 3043 

Pro Gly Val Val Phe Asp He Thr Lys Trp Leu Leu Ala Leu Leu Gly 
890 895 900 

cct get tac etc tta agg gec get ttg aca cat gtg ccg tac ttc gtc 3091 

Pro Ala Tyr Leu Leu Arg Ala Ala Leu Thr His Val Pro Tyr Phe Val 
905 910 915 

aga get cac get ctg ata agg gta tgc get ttg gtg aag cag etc gcg 3139 

Arg Ala His Ala Leu lie Arg Val Cys Ala Leu Val Lys Gin Leu Ala 

920 925 930 

ggg ggt agg tat gtt cag gtg gcg eta ttg gec ctt ggc agg tgg act 3187 

Gly Gly Arg Tyr Val Gin Val Ala Leu Leu Ala Leu Gly Arg Trp Thr 
935 940 945 

ggc acc tac ate tat gac cac etc aca cct atg teg gac tgg gec get 3235 
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Gly Thr Tyr He Tyr Asp His Leu Thr . Pro Met Ser Asp Trp Ala Ala 
950 955 960 965 

age ggc ctg cgc gac tta gcg gtc gec gtg gaa ccc ate ate ttc agt 3283 
Ser Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro He lie Phe Ser 
970 975 980 

ccg atg gag aag aag gtc ate gtc tgg gga gcg gag acg get gca tgt 3331 
Pro Met Glu Lys Lys Val He Val Trp Gly Ala Glu Thr Ala Ala Cys 
985 990 995 

ggg gac att eta cat gga ctt ccc gtg tec gec cga etc ggc cag gag 3379 
Gly Asp He Leu His Gly Leu Pro Val Ser Ala Arg Leu Gly Gin Glu 
1000 1005 1010 

ate etc etc ggc cca get gat ggc tac acc tec aag ggg tgg aag etc 3427 
He Leu Leu Gly Pro Ala Asp Gly Tyr Thr Ser Lys Gly Trp Lys Leu 
1015 1020 1025 

ctt get ccc ate act get tat gec cag caa aca cga ggc etc ctg ggc 3475 
Leu Ala Pro lie, Thr Ala Tyr Ala Gin Gin Thr' Arg Gly Leu Leu Gly 
1030 1035 1040 1045 

gec ata gtg gtg agt atg acg ggg cgt gac agg aca gaa cag gee ggg 3523 
Ala He Val Val Ser Met Thr Gly Arg Asp Arg Thr Glu Gin Ala Gly 
1050 1055 1060 



gaa gtc caa ate ctg tec aca gtc tct cag tec ttc etc gga aca acc 
Glu Val Gin He Leu Ser Thr Val Ser Gin Ser Phe Leu Gly Thr Thr 
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3571 



1065 1070 1075 

ate teg ggg gtt ttg tgg act gtt tac cac gga get ggc aac aag act 3619 
He Ser Gly Val Leu Trp Thr Val Tyr His Gly Ala Gly Asn Lys Thr 
1080 1085 1090 

eta gec ggc tta egg ggt ccg gtc acg cag atg tac teg agt get gag 3667 
Leu Ala Gly Leu Arg Gly Pro Val Thr Gin Met Tyr Ser Ser Ala Glu 
1095 1100 1105 

ggg gac ttg gta ggc tgg ccc age ccc cct ggg ace aag tct ttg gag 3715 
Gly Asp Leu Val Gly Trp Pro Ser Pro Pro Gly Thr Lys Ser Leu Glu 
1110 1115 1120 1125 

ccg tgc aag tgt gga gee gtc gac eta tat ctg gtc acg egg aac get 3763 
Pro Cys Lys Cys Gly Ala Val Asp Leu Tyr Leu Val Thr Arg Asn Ala 
1130 1135 1140 

gat gtc ate ccg get egg aga cgc ggg gac aag egg gga gca ttg etc 3811 
Asp Val lie Pro Ala Arg Arg Arg Gly Asp Lys Arg Gly Ala Leu Leu 
1145 1150 1155 

tec ccg aga ccc att teg acc ttg aag ggg tec teg ggg ggg ccg gtg 3859 
Ser Pro Arg Pro He Ser Thr Leu Lys Gly Ser Ser Gly Gly Pro Val 
1160 1165 1170 

etc tgc cct agg ggc. cac gtc gtt ggg etc ttc cga gca get gtg tgc 3907 
Leu Cys Pro Arg Gly His Val Val Gly Leu Phe Arg Ala Ala Val Cys 
1175 1180 1185 
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tct egg ggc gtg gec aaa tec ate gat ttc ate ccc gtt gag aca etc 
Ser Arg Gly Val Ala Lys Ser lie Asp Phe He Pro Val Glu Thr Leu 
1190 1195 1200 1205 



3955 



gac gtt gtt aca agg tct ccc act ttc agt gac aac age acg cca ccg 4003 
Asp Val Val Thr Arg Ser Pro Thr Phe Ser Asp Asn Ser Thr Pro Pro 
1210 1215 1220 

get gtg ccc cag acc tat cag gtc ggg tac ttg cat get cca act ggc 4051 
Ala Val Pro Gin Thr Tyr Gin Val Gly Tyr Leu His Ala Pro Thr Gly 
1225 1230 1235 

agt gga aag age acc aag gtc cct gtc gcg tat gec gee cag ggg tac 4099 
Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr Ala Ala Gin Gly Tyr 
1240 1245 1250 

aaa gta eta gtg ctt aac ccc teg gta get gec acc ctg ggg ttt ggg 4147 
Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala Thr Leu Gly Phe Gly 
1255 1260 1265 

gcg tac eta tec aag gca cat ggc ate aat ccc aac att agg act gga 4195 
Ala Tyr Leu Ser Lys Ala His Gly He Asn Pro Asn He Arg Thr Gly 
1270 1275 1280 1285 



gtc agg acc gtg atg acc ggg gag gec ate acg tac tec aca tat ggc 4243 
Val Arg Thr Val Met Thr Gly Glu Ala He Thr Tyr Ser Thr Tyr Gly 
1290 1295 1300 
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aaa ttt etc gec gat ggg ggc tgc get age ggc gee tat gac ate ate 4291 
Lys Phe Leu Ala Asp Gly Gly Cys Ala Ser Gly Ala Tyr Asp He He 
1305 1310 1315 

ata tgc gat gaa tgc cac get gtg gat get acc tec att etc ggc ate 4339 
lie Cys Asp Glu Cys His Ala Val Asp Ala Thr Ser He Leu Gly He 
1320 1325 1330 

gga acg gtc ctt gat caa gca gag aca gee ggg gtc aga eta act gtg 4387 
Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly Val Arg Leu Thr Val 
1335 1340 1345 

ctg get acg gec aca ccc ccc ggg tea gtg aca acc ccc cat ccc gat 4435 
Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Thr Pro His Pro Asp 
1350 1355 1360 1365 

ata gaa gag gta ggc etc ggg egg gag ggt gag ate ccc ttc tat ggg 4483 
He Glu Glu Val Gly Leu Gly Arg Glu Gly Glu He Pro Phe Tyr Gly 
1370 1375 1380 

agg gcg att ccc eta tec tgc ate aag gga ggg aga cac ctg att ttc 4531 
Arg Ala He Pro Leu Ser Cys He Lys Gly Gly Arg His Leu He Phe 
1385 1390 1395 

tgc cac tea aag aaa aag tgt gac gag etc gcg gcg gee ctt egg ggc 4579 
Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala Ala Ala Leu Arg Gly 
1400 1405 1410 



atg ggc ttg aat gec gtg gca tac tat aga ggg ttg gac gtc tec ata 4627 
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Met Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser He 
1415 1420 1425 



ata cca get cag gga gat gtg gtg gtc gtc gec acc gac gec etc atg 4675 
He Pro Ala Gin Gly Asp Val Val Val Val Ala Thr Asp Ala Leu Met 
1430 1435 1440 1445 

acg ggg tac act gga gac ttt gac tec gtg ate gac tgc aat gta gcg 4723 
Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val He Asp Cys Asn Val Ala 
1450 1455 1460 

gtc acc caa get gtc gac ttc age ctg gac ccc acc ttc act ata acc 4771 
Val Thr Gin Ala Val Asp Phe Ser Leu Asp Pro Thr Phe Thr He Thr 
1465 1470 1475 

aca cag act gtc cca caa gac get gtc tea cgc agt cag cgc cgc ggg 4819 
Thr Gin Thr Val Pro Gin Asp Ala Val Ser Arg Ser Gin Arg Arg Gly 
1480 1485 1490 

cgc aca ggt aga gga aga cag ggc act tat agg tat gtt tec act ggt 4867 
Arg Thr Gly Arg Gly Arg Gin Gly Thr Tyr Arg Tyr Val Ser Thr Gly 
1495 1500 1505 

gaa cga gee tea gga atg ttt gac agt gta gtg ctt tgt gag tgc tac 4915 
Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val Leu Cys Glu Cys Tyr 
1510 1515 1520 1525 

gac gca ggg get gcg tgg tac gat etc aca cca gcg gag acc acc gtc 4963 
Asp Ala Gly Ala Ala Trp Tyr Asp Leu Thr Pro Ala Glu Thr Thr Val 
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1530 



1535 



1540 



agg ctt aga gcg tat ttc aac acg ccc ggc eta ccc gtg tgt caa gac 5011 
Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu Pro Val Cys Gin Asp 
1545 1550 1555 

cat ctt gaa ttt tgg gag gca gtt ttc acc ggc etc aca cac ata gac 5059 
His Leu Glu Phe Trp Glu Ala Val Phe Thr Gly Leu Thr His He Asp 
1560 1565 1570 

gec cac ttc etc tec caa aca aag caa gcg ggg gag aac ttc gcg tac 5107 
Ala His Phe Leu Ser Gin Thr Lys Gin Ala Gly Glu Asn Phe Ala Tyr 
1575 1580 1585 

eta gta gee tac caa get acg gtg tgc gec aga gec aag gec cct ccc 5155 
Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Lys Ala Pro Pro 
1590 1595 1600 1605 

ccg tec tgg gac gee atg tgg aag tgc ctg gee Cga etc aag cct acg 5203 
Pro Ser Trp Asp Ala Met Trp Lys Cys Leu Ala Arg Leu Lys Pro Thr 
1610 1615 1620 

ctt gcg ggc ccc aca cct etc ctg tac cgt ttg ggc cct att acc aat 5251 
Leu Ala Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Pro He Thr Asn 
1625 1630 1635 

gag gtc acc etc aca cac cct ggg acg aag tac ate gec aca tgc atg 5299 
Glu Val. Thr Leu Thr His Pro Gly Thr Lys Tyr He Ala Thr Cys Met 
1640 1645 1650 
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caa get gac ctt gag gtc atg acc age acg tgg gtc eta get gga gga 5347 

Gin Ala Asp Leu Glu Val Met Thr Ser Thr Trp Val Leu Ala Gly Gly 
1655 1660 1665 

gtc ctg gca gec gtc gec gca tat tgc ctg gcg act gga tgc gtt tec 5395 

Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala Thr Gly Cys Val Ser 
1670 1675 1680 1685 

ate ate ggc cgc ttg cac gtc aac cag cga gtc gtc gtt gcg ccg gat 5443 

He lie Gly Arg Leu His Val Asn Gin Arg Val Val Val Ala Pro Asp 
1690 1695 1700 

aag gag gtc ctg tat gag get ttt gat gag atg gag gaa tgc gec tct 5491 

Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met Glu Glu Cys Ala Ser 
1705 1710 1715 

agg gcg get etc ate gaa gag ggg cag egg ata gee gag atg ttg aag 5539 

Arg Ala Ala Leu He Glu Glu Gly Gin Arg He Ala Glu Met Leu Lys 
1720 1725 1730 

tec aag ate caa ggc ttg ctg cag cag gee tct aag cag gec cag gac 5587 

Ser Lys He Gin Gly Leu Leu Gin Gin Ala Ser Lys Gin Ala Gin Asp 
1735 1740 1745 

ata caa ccc get atg cag get tea tgg ccc aaa gtg gaa caa ttt tgg 5635 

He Gin Pro Ala Met Gin Ala Ser Trp Pro Lys Val Glu Gin Phe Trp 
1750 1755 1760 1765 
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gcc aga cac atg tgg aac ttc att age ggc ate caa tac etc gca gga 5683 
Ala Arg His Met Trp Asn Phe He Ser Gly lie Gin Tyr Leu Ala Gly 
1770 1775 1780 

ttg tea aca ctg cca ggg aac ccc gcg gtg get tec atg atg gca ttc 5731 
Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala Ser Met Met Ala Phe 
1785 1790 1795 

agt gcc gcc etc acc agt ccg ttg teg acc agt acc ace ate ctt etc 5779 
Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser Thr Thr lie Leu Leu 
1800 1805 1810 

aac ate atg gga ggc tgg tta gcg tec cag ate gca cca ccc gcg ggg 5827 
Asn He Met Gly Gly Trp Leu Ala Ser Gin He Ala Pro Pro Ala Gly 
1815 1820 1825 

gcc acc ggc ttt gtc gtc agt ggc ctg gtg ggg get gcc gtg ggc age 5875 
Ala Thr Gly Phe Val Val Ser Gly Leu Val Gly Ala Ala Val Gly Ser 
1830 1835 1840 1845 

ata ggc ctg ggt aag gtg ctg gtg gac ate ctg gca gga tat ggt gcg 5923 
He Gly Leu Gly Lys Val Leu Val Asp He Leu Ala Gly Tyr Gly Ala 
1850 1855 1860 

ggc att teg ggg gcc etc gtc gca ttc aag ate atg tct ggc gag aag 5971 
Gly He Ser Gly Ala Leu Val Ala Phe Lys He Met Ser Gly Glu Lys 
1865 1870 1875 

ccc tct atg gaa gat gtc ate aat eta ctg cct ggg ate ctg tct ccg 6019 
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Pro Ser Met Glu Asp Val lie Asn Leu Leu Pro Gly lie Leu Ser Pro 
1880 1885 1890 



gga gcc ctg gtg gtg ggg gtc ate tgc gcg gec att ctg cgc cgc cac 6067 
Gly Ala Leu Val Val Gly Val He Cys Ala Ala He Leu Arg Arg His 
1895 1900 1905 

gtg gga ccg ggg gag ggc gcg gtc caa tgg atg aac agg ctt att gcc 6115 
Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met Asn Arg Leii He Ala 
1910 1915 1920 1925 

ttt get tec aga gga aac cac gtc gcc cct act cac tac gtg acg gag 6163 
Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr His Tyr Val Thr Glu 
1930 1935 1940 

teg gat gcg teg cag cgt gtg acc caa eta ctt ggc tct ctt act ata 6211 
Ser Asp Ala Ser Gin Arg Val Thr Gin Leu Leu Gly Ser Leu Thr lie 
1945 1950 1955 

acc age eta etc aga aga etc cac aat tgg ata act gag gac tgc ccc 6259 
Thr Ser Leu Leu Arg Arg Leu His Asn Trp He Thr Glu Asp Cys Pro 
1960 1965 1970 

ate cca tgc tec gga tec tgg etc cgc gac gtg tgg gac tgg gtt tgc 6307 
lie Pro Cys Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp Val Cys 
1975 1980 1985 

acc ate ttg aca gac ttc aaa aat tgg ctg acc tct aaa ttg ttc ccc 6355 
Thr He Leu Thr Asp Phe Lys Asn Trp Leu Thr Ser Lys Leu Phe Pro 
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1990 



1995 



2000 



2005 



aag ctg ccc ggc etc ccc ttc ate tct tgt caa aag ggg tac aag ggt 6403 

Lys Leu Pro Gly Leu Pro Phe He Ser Cys Gin Lys Gly Tyr Lys Gly 
2010 2015 2020 

gtg tgg gee ggc act ggc ate atg acc acg cgc tgc cct tgc ggc gee 6451 

Val Trp Ala Gly Thr Gly He Met Thr Thr Arg Cys Pro Cys Gly Ala 
2025 2030 2035 

aac ate tct ggc aat gtc cgc ctg ggc tct atg agg ate aca ggg cct 6499 

Asn He Ser Gly Asn Val Arg Leu Gly Ser Met Arg He Thr Gly Pro 

2040 2045 2050 

aaa acc tgc atg aac acc tgg cag ggg acc ttt cct ate aat tgc tac 6547 

Lys Thr Cys Met Asn Thr Trp Gin Gly Thr Phe Pro He Asn Cys Tyr 

2055 2060 2065 

acg gag ggc cag tgc gcg ccg aaa ccc ccc acg aac tac aag acc gee 6595 

Thr Glu Gly Gin Cys Ala Pro Lys Pro Pro Thr Asn Tyr Lys Thr Ala 

2070 2075 2080 2085 

ate tgg agg gtg gcg gee teg gag tac gcg gag gtg acg cag cat ggg 6643 

He Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu Val Thr Gin His Gly 
2090 2095 2100 

teg tac tec tat gta aca gga ctg acc act gac aat ctg aaa att cct 6691 

Ser Tyr Ser Tyr Val Thr Gly Leu Thr Thr Asp Asn Leu Lys He Pro 
2105 2110 2115 
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tgc caa eta cct tct cca gag ttt ttc tec tgg gtg gac ggt gtg cag 6739 

Cys Gin Leu Pro Ser Pro Glu Phe Phe Ser Trp Val Asp Gly Val Gin 
2120 2125 2130 

ate cat agg ttt gca ccc aca cca aag ccg ttt ttc egg gat gag gtc 6787 

He His Arg Phe Ala Pro Thr Pro Lys Pro Phe Phe Arg Asp Glu Val 
2135 2140 2145 

teg ttc tgc gtt ggg ctt aat tec tat get gtc ggg tec cag ctt ccc 6835 

Ser Phe Cys Val Gly Leu Asn Ser Tyr Ala Val Gly Ser Gin Leu Pro 
2150 2155 2160 2165 

tgt gaa cct gag ccc gac gca gac gta ttg agg tec atg eta aca gat 6883 

Cys Glu Pro Glu Pro Asp Ala Asp Val Leu Arg Ser Met Leu Thr Asp 
2170 2175 2180 

ccg ccc cac ate acg gcg gag act gcg gcg egg cgc ttg gca egg gga 6931 

Pro Pro His He Thr Ala Glu Thr Ala Ala Arg Arg Leu Ala Arg Gly 
2185 2190 2195 

tea cct cca tct gag gcg age tec tea gtg age cag eta tea gca ccg 6979 

Ser Pro Pro Ser Glu Ala Ser Ser Ser Val Ser Gin Leu Ser Ala Pro 
2200 2205 2210 

teg ctg egg gec acc tgc acc ace cac age aac acc tat gac gtg gac 7027 

Ser Leu Arg Ala Thr Cys Thr Thr His Ser Asn Thr Tyr Asp Val Asp 
2215 2220 2225 
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atg gtc gat gcc aac ctg etc atg gag ggc ggt gtg get cag aca gag 7075 
Met Val Asp Ala Asn Leu Leu Met Glu Gly Gly Val Ala Gin Thr Glu 
2230 2235 2240 2245 



cct gag 
Pro Glu 



tec agg gtg 
Ser Arg Val 
2250 



ccc gtt 
Pro Val 



ctg gac ttt 
Leu Asp Phe 
2255 



etc gag 
Leu Glu 



cca atg gcc 
Pro Met Ala 
2260 



gag 7123 
Glu 



gaa gag age gac ctt gag ccc tea ata cca teg gag tgc atg etc ccc 7171 
Glu Glu Ser Asp Leu Glu Pro Ser He Pro Ser Glu Cys Met Leu Pro 
2265 2270 2275 



agg age ggg ttt cca egg gcc tta ccg get tgg gca egg cct gac tac 7219 

Arg Ser Gly Phe Pro Arg Ala Leu Pro Ala Trp Ala Arg Pro Asp Tyr 
2280 2285 2290 

aac ccg ccg etc gtg gaa teg tgg agg agg cca gat tac caa ccg ccc 7267 

Asn Pro Pro Leu Val Glu Ser Trp Arg Arg Pro Asp Tyr Gin Pro Pro 
2295 2300 2305 

acc gtt get ggt tgt get etc ccc ccc ccc aag aag gcc ccg acg cct 7315 

Thr Val Ala Gly Cys Ala Leu Pro Pro Pro Lys Lys Ala Pro Thr Pro 
2310 2315 2320 2325 

ccc cca agg aga cgc egg aca gtg ggt ctg age gag age acc ata tea 7363 

Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser Glu Ser Thr He Ser 

2330 2335 2340 



gaa gcc etc cag caa ctg gcc ate aag acc ttt ggc cag ccc ccc teg 
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7411 



Glu Ala Leu Gin Gin Leu Ala He Lys Thr Phe Gly Gin Pro Pro Ser 
2345 2350 2355 



age ggt gat gca ggc teg tec acg ggg gcg ggc gee gee gaa tec ggc 7459 
Ser Gly Asp Ala Gly Ser Ser Thr Gly Ala Gly Ala Ala Glu Ser Gly 
2360 2365 2370 

ggt ccg acg tec cct ggt gag ccg gec ccc tea gag aca ggt tec gee 7507 
Gly Pro Thr Ser Pro Gly Glu Pro Ala Pro Ser Glu Thr Gly Ser Ala 
2375 2380 2385 

tec tct atg ccc ccc etc gag ggg gag cct gga gat ccg gac ctg gag 7555 
Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Glu 
2390 2395 2400 2405 

tct gat cag gta gag ctt caa cct ccc ccc cag ggg ggg ggg gta get 7603 
Ser Asp Gin Val Glu Leu Gin Pro Pro Pro Gin Gly Gly Gly Val Ala 
2410 2415 2420 

ccc ggt teg ggc teg ggg tct tgg tct act tgc tec gag gag gac gat 7651 
Pro Gly Ser Gly Ser Gly Ser Trp Ser Thr Cys Ser Glu Glu Asp Asp 
2425 2430 2435 

ace ace gtg tgc tgc tec atg tea tac tec tgg acc ggg get eta ata 7699 
Thr Thr Val Cys Cys Ser Met Ser Tyr Ser Trp Thr Gly Ala Leu He 
2440 2445 2450 

act ccc tgt age ccc gaa gag gaa aag ttg cca ate aac cct ttg agt 7747 
Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro He Asn Pro Leu Ser 
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2455 2460 2465 

aac teg ctg ttg cga tac cat aac aag gtg tac tgt aca aca tea aag 7795 

Asn Ser Leu Leu Arg Tyr His Asn Lys Val Tyr Cys Thr Thr Ser Lys 
2470 2475 2480 2485 

age gee tea cag agg get aaa aag gta act ttt gac agg acg caa gtg 7843 

Ser Ala Ser Gin Arg Ala Lys Lys Val Thr Phe Asp Arg Thr Gin Val 
2490 2495 2500 

etc gac gec cat tat gac tea gtc tta aag gac ate aag eta gcg get 7891 

Leu Asp Ala His Tyr Asp Ser Val Leu Lys Asp lie Lys Leu Ala Ala 

2505 2510 2515 

tec aag gtc age gca agg etc etc acc ttg gag gag gcg tgc cag ttg 7939 

Ser Lys Val Ser Ala Arg Leu Leu Thr Leu Glu Glu Ala Cys Gin Leu 
2520 2525 2530 

act cca ccc cat tct gca aga tec aag tat gga ttc ggg gec aag gag 7987 

Thr Pro Pro His Ser Ala Arg Ser Lys Tyr Gly Phe Gly Ala Lys Glu 
2535 2540 2545 

gtc cgc age ttg tec ggg agg gec gtt aac cac ate aag tec gtg tgg 8035 

Val Arg Ser Leu Ser Gly Arg Ala Val Asn His He Lys Ser Val Trp 
2550 2555 2560 2565 

aag gac etc ctg gaa gac cca caa aca cca att ccc aca acc ate atg 8083 

Lys Asp Leu Leu Glu Asp Pro Gin Thr Pro He Pro Thr Thr He Met 
2570 2575 2580 
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gcc aaa aat gag gtg ttc tgc gtg gac ccc gcc aag ggg ggt aag aaa 8131 
Ala Lys Asn Glu Val Phe Cys Val Asp Pro Ala Lys Gly Gly Lys Lys 
2585 2590 2595 

cca get cgc etc ate gtt tac cct gac etc ggc gtc egg gtc tgc gag 8179 
Pro Ala Arg Leu He Val Tyr Pro Asp Leu Gly Val Arg Val Cys Glu 
2600 2605 2610 

aaa atg gcc etc tat gac att aca caa aag ctt cct cag gcg gta atg 8227 
Lys Met Ala Leu Tyr Asp He Thr Gin Lys Leu Pro Gin Ala Val Met 
2615 2620 2625 

gga get tec tat ggc ttc cag tac tec cct gcc caa egg gtg gag tat 8275 
Gly Ala Ser Tyr Gly Phe Gin Tyr Ser Pro Ala Gin Arg Val Glu Tyr 
2630 2635 2640 2645 

etc ttg aaa gca tgg gcg gaa aag aag gac ccc atg ggt ttt teg tat 8323 
Leu Leu Lys Ala Trp Ala Glu Lys Lys Asp Pro Met Gly Phe Ser Tyr 
2650 2655 2660 

gat ace cga tgc ttc gac tea acc gtc act gag aga gac ate agg acc 8371 
Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Arg Asp He Arg Thr 
2665 2670 2675 

gag gag tec ata tac cag gcc tgc tec ctg ccc gag gag gcc cgc act 8419 
Glu Glu Ser He Tyr Gin Ala Cys Ser Leu Pro Glu Glu Ala Arg Thr 
2680 2685 2690 
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gcc ata cac teg ctg act gag aga ctt tac gta gga ggg ccc atg ttc 8467 

Ala He His Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly Pro Met Phe 
2695 2700 2705 

aac age aag ggt caa ace tgc ggt tac aga cgt tgc cgc gcc age ggg 8515 

Asn Ser Lys Gly Gin Thr Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly 
2710 2715 2720 2725 

gtg eta acc act age atg ggt aac ace ate aca tgc tat gtg aaa gcc 8563 

Val Leu Thr Thr Ser Met Gly Asn Thr He Thr Cys Tyr Val Lys Ala 
2730 2735 2740 

eta gcg gcc tgc aag get gcg ggg ata gtt gcg ccc aca atg ctg gta 8611 

Leu Ala Ala Cys Lys Ala Ala Gly He Val Ala Pro Thr Met Leu Val 
2745 2750 2755 

tgc ggc gat gac eta gta gtc ate tea gaa age cag ggg act gag gag 8659 

Cys Gly Asp Asp Leu Val Val He Ser Glu Ser Gin Gly Thr Glu Glu 
2760 2765 2770 

gac gag egg aac ctg aga gcc ttc acg gag gcc atg acc agg tac tct 8707 

Asp Glu Arg Asn Leu Arg Ala Phe Thr Glu Ala Met Thr Arg Tyr Ser 
2775 2780 2785 

gcc cct cct ggt gat ccc ccc aga ccg gaa tat gac ctg gag eta ata 8755 

Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr Asp Leu Glu Leu He 
2790 2795 2800 2805 



aca tec tgt tec tea aat gtg tct gtg gcg ttg ggc ccg egg ggc cgc 
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8803 



Thr Ser Cys Ser Ser Asn Val Ser Val Ala Leu Gly Pro Arg Gly Arg 
2810 2815 2820 

cgc aga tac tac ctg acc aga gac cca acc act cca etc gec egg get 8851 
Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala 
2825 2830 2835 

gee tgg gaa aca gtt aga cac tec cct ate aat tea tgg ctg gga aac 8899 
Ala Trp Glu Thr Val Arg His Ser Pro He Asn Ser Trp Leu Gly Asn 
2840 2845 2850 

ate ate cag tat get cca acc ata tgg gtt cgc atg gtc eta atg aca 8947 
He He Gin Tyr Ala Pro Thr lie Trp Val Arg Met Val Leu Met Thr 
2855 2860 2865 

cac ttc ttc tec att etc atg gtc caa gac acc ctg gac cag aac etc 8995 
His Phe Phe Ser lie Leu Met Val Gin Asp Thr Leu Asp Gin Asn Leu 
2870 2875 2880 2885 

aac ttt gag atg tat gga tea gta tac tec gtg aat cct ttg gac ctt 9043 
Asn Phe Glu Met Tyr Gly Ser Val Tyr Ser Val Asn Pro Leu Asp Leu 
2890 2895 2900 

cca gee ata att gag agg tta cac ggg ctt gac gee ttt tct atg cac 9091 
Pro Ala lie He Glu Arg Leu His Gly Leu Asp Ala Phe Ser Met His 
2905 2910 2915 



aca tac tct cac cac gaa ctg acg egg gtg get tea gee etc aga aaa 
Thr Tyr Ser His His Glu Leu Thr Arg Val Ala Ser Ala Leu Arg Lys 
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9139 



2920 2925 2930 

ctt ggg gcg cca ccc etc agg gtg tgg aag agt egg get cgc gca gtc 9187 
Leu Gly Ala Pro Pro Leu Arg Val Trp Lys Ser Arg Ala Arg Ala Val 
2935 2940 2945 

agg gcg tec etc ate tec cgt gga ggg aaa gcg gee gtt tgc ggc cga 9235 
Arg Ala Ser Leu He Ser Arg Gly Gly Lys Ala Ala Val Cys Gly Arg 
2950 2955 2960 2965 

tat etc ttc aat tgg gcg gtg aag acc aag etc aaa etc act cca ttg 9283 
Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu Lys Leu Thr Pro Leu 
2970 2975 2980 

ccg gag gcg cgc eta ctg gac tta tec agt tgg ttc acc gtc ggc gee 9331 
Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp Phe Thr Val Gly Ala 
2985 2990 2995 

ggc ggg ggc gac att ttt cac age gtg teg cgc gec cga ccc cgc tea 9379 
Gly Gly Gly Asp He Phe His Ser Val Ser Arg Ala Arg Pro Arg Ser 
3000 3005 3010 

tta etc ttc ggc eta etc eta ctt ttc gta ggg gta ggc etc ttc eta 9427 
Leu Leu Phe Gly Leu Leu Leu Leu Phe Val Gly Val .Gly Leu Phe Leu 
3015 3020 3025 

etc ccc get egg tag agcggcacac actaggtaca ctccatagct aactgttcct 9482 

Leu Pro Ala Arg 

3030 
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tttttttttt tttttttttt tttttttttt tttttttttt ttcttttttt tttttttccc 9542 
tctttcttcc cttctcatct tattctactt tctttcttgg tggctccatc ttagccctag 9602 
tcacggctag ctgtgaaagg tccgtgagcc gcatgactgc agagagtgcc gtaactggtc 9662 
tctctgcaga tcatgt . 9678 

<210> 4 
<211> 3033 
<212> PRT 

<213> Hepatitis C virus 
<400> 4 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 

15 10 15 

Arg Arg Pro Glu Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 

20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Thr 

35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 

50 55 60 

He Pro Lys Asp Arg Arg Ser Thr Gly Lys Ala Trp Gly Lys Pro Gly 
65 70 75 80 

Arg Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 

85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
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100 105 110 

Arg His Arg Ser Arg Asn Val Gly Lys Val He Asp Thr Leu Thr Cys 

115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr He Pro Val Val Gly Ala Pro Leu 

130 135 140 

Ser Gly Ala Ala Arg Ala Val Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Phe Pro Phe Ser He 

165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys He Thr Val Pro Val Ser Ala Ala 

180 185 190 

Gin Val Lys Asn Thr Ser Ser Ser Tyr Met Val Thr Asn Asp Cys Ser 

195 200 205 

Asn Asp Ser He Thr Trp Gin Leu Glu Ala Ala Val Leu His Val Pro 

210 215 220 

Gly Cys Val Pro Cys Glu Arg Val Gly Asn Thr Ser Arg Cys Trp Val 
225 230 235 240 

Pro Val Ser Pro Asn Met Ala Val Arg Gin Pro Gly Ala Leu Thr Gin 

245 . 250 255 

Gly Leu Arg Thr His He Asp Met Val Val Met Ser Ala Thr Phe Cys 

260 265 270 

Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ala 

275 280 285 

Gin Val Phe He Val Ser Pro Gin Tyr His Trp Phe Val Gin Glu Cys 

290 295 300 

Asn Cys Ser He Tyr Pro Gly Thr He Thr Gly His Arg Met Ala Trp 
305 310 315 320 

Asp Met Met Met Asn Trp Ser Pro Thr Ala Thr Met He Leu Ala Tyr 
325 330 335 
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Val Met Arg Val Pro Glu Val He He Asp He Val Ser Gly Ala His 

340 345 350 

Trp Gly Val Met Phe Gly Leu Ala Tyr Phe Ser Met Gin Gly Ala Trp 

355 360 365 

Ala Lys Val He Val He Leu Leu Leu Ala Ala Gly Val Asp Ala Gly 

370 375 380 

Thr Thr Thr Val Gly Gly Ala Val Ala Arg Ser Thr Asn Val He Ala 
385 390 395 400 

Gly Val Phe Ser His Gly Pro Gin Gin Asn He Gin Leu He Asn Thr 

405 410 415 

Asn Gly Ser Trp His lie Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 

420 425 430 

Leu Asn Thr Gly Phe Leu Ala Ala Leu Phe Tyr Thr Asn Arg Phe Asn 

435 440 445 

Ser Ser Gly Cys Pro Gly Arg Leu Ser Ala Cys Arg Asn lie Glu Ala 

450 455 460 

Phe Arg He Gly Trp Gly Thr Leu Gin Tyr Glu Asp Asn Val Thr Asn 
465 470 475 480 

Pro Glu Asp Met Arg Pro Tyr Cys Trp His Tyr Pro Pro Lys Pro Cys 

485 490 495 

Gly Val Val Pro Ala Arg Ser Val Cys Gly Pro Val Tyr Cys Phe Thr 

500 505 510 

Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Arg Gly Val Pro Thr 

515 520 525 

Tyr Thr Trp Gly Glu Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr 

530 535 540 

Arg Pro Pro Gin Gly Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Thr 
545 550 555 560 

Gly Phe Thr Lys Thr Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp 
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565 570 575 

Phe Asn Ala Ser Thr Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys 

580 585 590 

His Pro Asp Ala Thr Tyr He Lys Cys Gly Ser Gly Pro Trp Leu Thr 

595 600 605 

Pro Lys Cys Leu Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys 

610 615 620 

Thr Val Asn Phe Thr He Phe Lys He Arg Met Tyr Val Gly Gly Val 
625 630 635 640 

Glu His Arg Leu Thr Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys 

645 650 655 

Asp Leu Glu Asp Arg Asp Arg Ser Gin Leu Ser Pro Leu Leu His Ser 

660 665 670 

Thr Thr Glu Trp Ala He Leu Pro Cys Thr Tyr Ser Asp Leu Pro Ala 

675 680 685 

Leu Ser Thr Gly Leu Leu His Leu His Gin Asn lie Val Asp Val Gin 

690 695 700 

Tyr Met Tyr Gly Leu Ser Pro Ala He Thr Lys Tyr Val Val Arg Trp 
705 710 715 720 

Glu Trp Val Val Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys 

725 730 735 

Ala Cys Leu Trp Met Leu lie Leu Leu Gly Gin Ala Glu Ala Ala Leu 

740 745 750 

Glu Lys Leu Val Val Leu His Ala Ala Ser Ala Ala Asn Cys His Gly 

755 760 765 

Leu Leu Tyr Phe Ala He Phe Phe Val Ala Ala Trp His He Arg Gly 

770 775 780 

Arg Val Val Pro Leu Thr Thr Tyr Cys Leu Thr Gly Leu Trp Pro Phe 
785 790 795 800 
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Cys Leu Leu Leu Met 
805 

Pro Val His Gly Gin 
820 

Phe Thr Leu Thr Pro 
835 

Trp Leu Cys Tyr Leu 
850 

Val Pro Pro Met Gin 
865 

Val Thr lie Phe Cys 
885 

Leu Ala Leu Leu Gly 
900 

Val Pro Tyr Phe Val 
915 

Val Lys Gin Leu Ala 
930 

Leu Gly Arg Trp Thr 
945 

Ser Asp Trp Ala Ala 
965 

Pro He He Phe Ser 
980 

Glu Thr Ala Ala Cys 
995 

Arg Leu Gly Gin Glu 
1010 

Lys Gly Trp Lys Leu 



Ala Leu Pro Arg Gin 
810 

He Gly Val Gly Leu 
825 

Gly Tyr Lys Thr Leu 
840 

Leu Thr Leu Gly Glu 
855 

Val Arg Gly Gly Arg 
870 

Pro Gly Val Val Phe 
890 

Pro Ala Tyr Leu Leu 
905 

Arg Ala His Ala Leu 
920 

Gly Gly Arg Tyr Val 
935 

Gly Thr Tyr He Tyr 
950 

Ser Gly Leu Arg Asp 
970 

Pro Met Glu Lys Lys 
985 

Gly Asp He Leu His 
1000 

lie Leu Leu Gly Pro 
1015 

Leu Ala Pro He Thr 
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Ala Tyr Ala Tyr Asp Ala 
815 

Leu lie Leu He Thr Leu 
830 

Leu Gly Gin Cys Leu Trp 
845 

Ala Met He Gin Glu Trp 
860 

Asp Gly He Ala Trp Ala 
875 880 
Asp lie Thr Lys Trp Leu 
895 

Arg Ala Ala Leu Thr His 
910 

He Arg Val Cys Ala Leu 
925 

Gin Val Ala Leu Leu Ala 
940 

Asp His Leu Thr Pro Met 
955 960 
Leu Ala Val Ala Val Glu 
975 

Val He Val Trp Gly Ala 
990 

Gly Leu Pro Val Ser Ala 
1005 

Ala Asp Gly Tyr Thr Ser 
1020 

Ala Tyr Ala Gin Gin Thr 



1025 1030 1035 1040 

Arg Gly Leu Leu Gly Ala He Val Val Ser Met Thr Gly Arg Asp Arg 

1045 1050 1055 

Thr Glu Gin Ala Gly Glu Val Gin He Leu Ser Thr Val Ser Gin Ser 

1060 1065 1070 

Phe Leu Gly Thr Thr He Ser Gly Val Leu Trp Thr Val Tyr His Gly 

1075 1080 1085 

Ala Gly Asn Lys Thr Leu Ala Gly Leu Arg Gly Pro Val Thr Gin Met 

1090 1095 1100 

Tyr Ser Ser Ala Glu Gly Asp Leu Val Gly Trp Pro Ser Pro Pro Gly 
1105 1110 1115 1120 

Thr Lys Ser Leu Glu Pro Cys Lys Cys Gly Ala Val Asp Leu Tyr Leu 

1125 1130 1135 

Val Thr Arg Asn Ala Asp Val lie Pro Ala Arg Arg Arg Gly Asp Lys 

1140 1145 1150 

Arg Gly Ala Leu Leu Ser Pro Arg Pro He Ser Thr Leu Lys Gly Ser 

1155 1160 1165 

Ser Gly Gly Pro Val Leu Cys Pro Arg Gly His Val Val Gly Leu Phe 

1170 1175 1180 

Arg Ala Ala Val Cys Ser Arg Gly Val Ala Lys Ser He Asp Phe He 
1185 1190 1195 1200 

Pro Val Glu Thr Leu Asp Val Val Thr Arg Ser Pro Thr Phe Ser Asp 

1205 1210 1215 

Asn Ser Thr Pro Pro Ala Val Pro Gin Thr Tyr Gin Val Gly Tyr Leu 

1220 1225 1230 

His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr 

1235 1240 1245 

Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala 
1250 1255 1260 
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Thr Leu Gly Phe Gly Ala Tyr Leu Ser Lys Ala His Gly He Asn Pro 
1265 1270 1275 1280 

Asn He Arg Thr Gly Val Arg Thr Val Met Thr Gly Glu Ala He Thr 

1285 1290 1295 

Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ala Ser Gly 

1300 1305 1310 

Ala Tyr Asp He He He Cys Asp Glu Cys His Ala Val Asp Ala Thr 

1315 1320 1325 

Ser He Leu Gly He Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly 

1330 1335 1340 

Val Arg Leu Thr Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr 
1345 1350 1355 1360 

Thr Pro His Pro Asp He Glu Glu Val Gly Leu Gly Arg Glu Gly Glu 

1365 1370 1375 

He Pro Phe Tyr Gly Arg Ala He Pro Leu Ser Cys He Lys Gly Gly 

1380 1385 1390 

Arg His Leu He Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala 

1395 1400 1405 

Ala Ala Leu Arg Gly Met Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly 

1410 1415 1420 

Leu Asp Val Ser He He Pro Ala Gin Gly Asp Val Val Val Val Ala 
1425 1430 1435 1440 

Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val He 

1445 1450 1455 

Asp Cys Asn Val Ala Val Thr Gin Ala Val Asp Phe Ser Leu Asp Pro 

1460 1465 1470 

Thr Phe Thr He Thr Thr Gin Thr Val Pro Gin Asp Ala Val Ser Arg 

1475 1480 1485 

Ser Gin Arg Arg Gly Arg Thr Gly Arg Gly Arg Gin Gly Thr Tyr Arg 
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1490 1495 1500 

Tyr Val Ser Thr Gly Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val 
1505 1510 1515 1520 

Leu Cys Glu Cys Tyr Asp Ala Gly Ala Ala Trp Tyr Asp Leu Thr Pro 

1525 1530 1535 

Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu 

1540 1545 1550 

Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Ala Val Phe Thr Gly 

1555 1560 1565 

Leu Thr His He Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ala Gly 

1570 1575 1580 

Glu Asn Phe Ala Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg 
1585 1590 1595 1600 

Ala Lys Ala Pro Pro Pro Ser Trp Asp Ala Met Trp Lys Cys Leu Ala 

1605 1610 1615 

Arg Leu Lys Pro Thr Leu Ala Gly Pro Thr Pro Leu Leu Tyr Arg Leu 

1620 1625 1630 

Gly Pro He Thr Asn Glu Val Thr Leu Thr His Pro Gly Thr Lys Tyr 

1635 1640 1645 

He Ala Thr Cys Met Gin Ala Asp Leu Glu Val Met Thr Ser Thr Trp 

1650 1655 1660 

Val Leu Ala Gly Gly Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala 
1665 1670 1675 1680 

Thr Gly Cys Val Ser He He Gly Arg Leu His Val Asn Gin Arg Val 

1685 1690 1695 

Val Val Ala Pro Asp Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met 

1700 1705 1710 

Glu Glu Cys Ala Ser Arg Ala Ala Leu He Glu Glu Gly Gin Arg He 
1715 1720 1725 
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Ala Glu Met Leu Lys Ser Lys lie Gin Gly Leu Leu Gin Gin Ala Ser 

1730 1735 1740 

Lys Gin Ala Gin Asp He Gin Pro Ala Met Gin Ala Ser Trp Pro Lys 
1745 1750 1755 1760 

Val Glu Gin Phe Trp Ala Arg His Met Trp Asn Phe lie Ser Gly lie 

1765 1770 1775 

Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala 

1780 1785 1790 

Ser Met Met Ala Phe Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser 

1795 1800 1805 

Thr Thr He Leu Leu Asn He Met Gly Gly Trp Leu Ala Ser Gin He 

1810 1815 1820 

Ala Pro Pro Ala Gly Ala Thr Gly Phe Val Val Ser Gly Leu Val Gly 
1825 1830 1835 1840 

Ala Ala Val Gly Ser He Gly Leu Gly Lys Val Leu Val Asp He Leu 

1845 1850 1855 

Ala Gly Tyr Gly Ala Gly He Ser Gly Ala Leu Val Ala Phe Lys He 

1860 1865 1870 

Met Ser Gly Glu Lys Pro Ser Met Glu Asp Val He Asn Leu Leu Pro 

1875 1880 1885 

Gly He Leu Ser Pro Gly Ala Leu Val Val Gly Val He Cys Ala Ala 

1890 1895 1900 

He Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met 
1905 1910 1915 1920 

Asn Arg Leu He Ala Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr 

1925 1930 1935 

His Tyr Val Thr Glu Ser Asp Ala Ser Gin Arg Val Thr Gin Leu Leu 

1940 1945 1950 

Gly Ser Leu Thr He Thr Ser Leu Leu Arg Arg Leu His Asn Trp He 
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1955 1960 1965 

Thr Glu Asp Cys Pro He Pro Cys Ser Gly Ser Trp Leu Arg Asp Val 

1970 1975 1980 

Trp Asp Trp Val Cys Thr He Leu Thr Asp Phe Lys Asn Trp Leu Thr 
1985 1990 1995 2000 

Ser Lys Leu Phe Pro Lys Leu Pro Gly Leu Pro Phe He Ser Cys Gin 

2005 2010 2015 

Lys Gly Tyr Lys Gly Val Trp Ala Gly Thr Gly lie Met Thr Thr Arg 

2020 2025 2030 

Cys Pro Cys Gly Ala Asn He Ser Gly Asn Val Arg Leu Gly Ser Met 

2035 2040 2045 

Arg He Thr Gly Pro Lys Thr Cys Met Asn Thr Trp Gin Gly Thr Phe 

2050 2055 2060 

Pro He Asn Cys Tyr Thr Glu Gly Gin Cys Ala Pro Lys Pro Pro Thr 
2065 2070 2075 2080 

Asn Tyr Lys Thr Ala He Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu 

2085 2090 2095 

Val Thr Gin His Gly Ser Tyr Ser Tyr Val Thr Gly Leu Thr Thr Asp 

2100 2105 2110 

Asn Leu Lys He Pro Cys Gin Leu Pro Ser Pro Glu Phe Phe Ser Trp 

2115 2120 2125 

Val Asp Gly Val Gin He His Arg Phe Ala Pro Thr Pro Lys Pro Phe 

2130 2135 2140 

Phe Arg Asp Glu Val Ser Phe Cys Val Gly Leu Asn Ser Tyr Ala Val 
2145 2150 2155 2160 

Gly Ser Gin Leu Pro Cys Glu Pro Glu Pro Asp Ala Asp Val Leu Arg 

2165 2170 2175 

Ser Met Leu Thr Asp Pro Pro His He Thr Ala Glu Thr Ala Ala Arg 
2180 2185 2190 
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Arg Leu Ala Arg Gly Ser Pro Pro Ser Glu Ala Ser Ser Ser Val Ser 

2195 2200 2205 

Gin Leu Ser Ala Pro Ser Leu Arg Ala Thr Cys Thr Thr His Ser Asn 

2210 2215 2220 

Thr Tyr Asp Val Asp Met Val Asp Ala Asn Leu Leu Met Glu Gly Gly 
2225 2230 2235 2240 

Val Ala Gin Thr Glu Pro Glu Ser Arg Val Pro Val Leu Asp Phe Leu 

2245 2250 2255 

Glu Pro Met Ala Glu Glu Glu Ser Asp Leu Glu Pro Ser He Pro Ser 

2260 2265 2270 

Glu Cys Met Leu Pro Arg Ser Gly Phe Pro Arg Ala Leu Pro. Ala Trp 

2275 2280 2285 

Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu Ser Trp Arg Arg Pro 

2290 2295 2300 

Asp Tyr Gin Pro Pro Thr Val Ala Gly Cys Ala Leu Pro Pro Pro Lys 
2305 2310 2315 2320 

Lys Ala Pro Thr Pro Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser 

2325 2330 2335 

Glu Ser Thr He Ser Glu Ala Leu Gin Gin Leu Ala He Lys Thr Phe 

2340 2345 2350 

Gly Gin Pro Pro Ser Ser Gly Asp Ala Gly Ser Ser Thr Gly Ala Gly 

2355 2360 2365 

Ala Ala Glu Ser Gly Gly Pro Thr Ser Pro Gly Glu Pro Ala Pro Ser 

2370 2375 2380 

Glu Thr Gly Ser Ala Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly 
2385 2390 2395 2400 

Asp Pro Asp Leu Glu Ser Asp Gin Val Glu Leu Gin Pro Pro Pro Gin 

2405 2410 2415 

Gly Gly Gly Val Ala Pro Gly Ser Gly Ser Gly Ser Trp Ser Thr Cys 
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2420 2425 2430 

Ser Glu Glu Asp Asp Thr Thr Val Cys Cys Ser Met Ser Tyr Ser Trp 

2435 2440 2445 

Thr Gly Ala Leu lie Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro 

2450 2455 2460 

lie Asn Pro Leu Ser Asn Ser Leu Leu Arg Tyr His Asn Lys Val Tyr 
2465 2470 2475 2480 

Cys Thr Thr Ser Lys Ser Ala Ser Gin Arg Ala Lys Lys Val Thr Phe 

2485 2490 2495 

Asp Arg Thr Gin Val Leu Asp Ala His Tyr Asp Ser Val Leu Lys Asp 

2500 2505 2510 

lie Lys Leu Ala Ala Ser Lys Val Ser Ala Arg Leu Leu Thr Leu Glu 

2515 2520 2525 

Glu Ala Cys Gin Leu Thr Pro Pro His Ser Ala Arg Ser Lys Tyr Gly 

2530 2535 2540 

Phe Gly Ala Lys Glu Val Arg Ser Leu Ser Gly Arg Ala Val Asn His 
2545 2550 2555 2560 

lie Lys Ser Val Trp Lys Asp Leu Leu Glu Asp Pro Gin Thr Pro lie 

2565 2570 2575 

Pro Thr Thr He Met Ala Lys Asn Glu Val Phe Cys Val Asp Pro Ala 

2580 2585 2590 

Lys Gly Gly Lys Lys Pro Ala Arg Leu He Val Tyr Pro Asp Leu Gly 

2595 2600 2605 

Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp He Thr Gin Lys Leu 

2610 2615 2620 

Pro Gin Ala Val Met Gly Ala Ser Tyr Gly Phe Gin Tyr Ser Pro Ala 
2625 2630 2635 2640 

Gin Arg Val Glu Tyr Leu Leu Lys Ala Trp Ala Glu Lys Lys Asp Pro 
2645 2650 2655 
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Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu 

2660 2665 2670 

Arg Asp lie Arg Thr Glu Glu Ser He Tyr Gin Ala Cys Ser Leu Pro 

2675 2680 2685 

Glu Glu Ala Arg Thr Ala He His Ser Leu Thr Glu Arg Leu Tyr Val 

2690 2695 2700 

Gly Gly Pro Met Phe Asn Ser Lys Gly Gin Thr Cys Gly Tyr Arg Arg 
2705 2710 2715 2720 

Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Met Gly Asn Thr He Thr 

2725 2730 2735 

Cys Tyr Val Lys Ala Leu Ala Ala Cys Lys Ala Ala Gly lie Val Ala 

2740 2745 2750 

Pro Thr Met Leu Val Cys Gly Asp Asp Leu Val Val He Ser Glu Ser 

.2755 2760 2765 

Gin Gly Thr Glu Glu Asp Glu Arg Asn Leu Arg Ala Phe Thr Glu Ala 

2770 2775 • 2780 

Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr 
2785 2790 2795 2800 

Asp Leu Glu Leu He Thr Ser Cys Ser Ser Asn Val Ser Val Ala Leu 

2805 2810 2815 

Gly Pro Arg Gly Arg Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr 

2820 2825 2830 

Pro Leu Ala Arg Ala Ala Trp Glu Thr Val Arg His Ser Pro He Asn 

2835 2840 2845 

Ser Trp Leu Gly Asn He He Gin Tyr Ala Pro Thr He Trp Val Arg 

• 2850 2855 2860 

Met Val Leu Met Thr His Phe Phe Ser He Leu Met Val Gin Asp Thr 
2865 2870 2875 2880 

Leu Asp Gin Asn Leu Asn Phe Glu Met Tyr Gly Ser Val Tyr Ser Val 
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2885 2890 2895 

Asn Pro Leu Asp Leu Pro Ala lie He Glu Arg Leu His Gly Leu Asp 

2900 2905 2910 

Ala Phe Ser Met His Thr Tyr Ser His His Glu Leu Thr Arg Val Ala 

2915 2920 2925 

Ser Ala Leu Arg Lys Leu Gly Ala Pro Pro Leu Arg Val Trp Lys Ser 

2930 2935 2940 

Arg Ala Arg Ala Val Arg Ala Ser Leu He Ser Arg Gly Gly Lys Ala 
2945 2950 2955 2960 

Ala Val Cys Gly Arg Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu 

2965 2970 2975 

Lys Leu Thr Pro Leu Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp 

2980 2985 2990 

Phe Thr Val Gly Ala Gly Gly Gly Asp He Phe His Ser Val Ser Arg 

2995 3000 3005 

Ala Arg Pro Arg Ser Leu Leu Phe Gly Leu Leu Leu Leu Phe Val Gly 

3010 3015 3020 

Val Gly Leu Phe Leu Leu Pro Ala Arg 
3025 3030 



<210> 5 

<211> 9674 

<212> DNA 

<213> Hepatitis C virus 

<220> 
<221> CDS 

<222> (341). . (9442) 
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<400> 5 

acccgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 60 

cttcacgcag aaagcgtcta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 120 

ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtacaccgg aattgccggg 180 

aagactgggt cctttcttgg ataaacccac tctatgcccg gccatttggg cgtgcccccg 240 

caagactgct agccgagtag cgttgggttg cgaaaggcct tgtggtactg cctgataggg 300 

tgcttgcgag tgccccggga ggtctcgtag accgtgcacc atg age aca aat ccc 355 

Met Ser Thr Asn Pro 

1 5 

aaa cct caa aga aaa acc aaa aga aac act aac cgt cgc cca caa gac 403 

Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn Arg Arg Pro Gin Asp 

10 15 20 

gtt aag ttt ccg ggc ggc ggc cag ate gtt ggc gga gta tac ttg ttg 451 
Val Lys Phe Pro Gly Gly Gly Gin He Val Gly Gly Val Tyr Leu Leu 
25 30 35 

ccg cgc agg ggc ccc agg ttg ggt gtg cgc gcg aca agg aag get teg 499 
Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala Thr Arg Lys Ala Ser 
40 45 50 

gag egg tec cag cca cgt ggg agg cgc cag ccc ate ccc aaa cat egg 547 
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Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro He Pro Lys His Arg 

55 60 65 

cgc tec act ggc aag tec tgg ggg aag cca gga tac ccc tgg ccc ctg 595 

Arg Ser Thr Gly Lys Ser Trp Gly Lys Pro Gly Tyr Pro Trp Pro Leu 

70 75 80 85 

tat ggg aat gag ggg etc ggt tgg gca gga tgg etc ctg tec cct cga 643 

Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp Leu Leu Ser Pro Arg 

90 95 100 

ggt tec cgt ccc tea tgg ggc ccc aat gac ccc egg cat agg teg cgc 691 

Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro Arg His Arg Ser Arg 

105 110 115 

aat gtg ggt aag gtc ate gat ace eta acg tgc ggc ttt gee gac etc 739 

Asn Val Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 

120 125 130 

ttg ggg tac gtc ccc gtc gta ggc gec ccg ctt agt ggc gtt gee agt 787 

Leu Gly Tyr Val Pro Val Val Gly Ala Pro Leu Ser Gly Val Ala Ser 

135 140 145 

get etc gcg cac ggc gtg aga gtc ctg gag gac ggg gtt aat ttt gca 835 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Phe Ala • 

150 155 160 165 



aca ggg aac tta cct ggt tgc tec ttt tct ate ttc ttg ctg gee eta 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
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883 



170 



175 



180 



ctg tec tgc ate act act ccg gtc tct get gtc caa gtg aag aac acc 931 

Leu Ser Cys He Thr Thr Pro Val Ser Ala Val Gin Val Lys Asn Thr 

185 190 195 

age aac gee tat atg gcg act aac gac tgt tec aat gac age ate act 979 

Ser Asn Ala Tyr Met Ala Thr Asn Asp Cys Ser Asn Asp Ser He Thr 

200 205 210 

tgg cag ctt gag gee gca gtc etc cat gtc ccc ggg tgc gtc ccg tgc 1027 

Trp Gin Leu Glu Ala Ala Val Leu His Val Pro Gly Cys Val Pro Cys 

215 220 225 



gagaaa atg ggg aac aca tea egg tgc tgg ata cca gtc tea cca aac 1075 
Glu Lys Met Gly Asn Thr Ser Arg Cys Trp lie Pro Val Ser Pro Asn 
230 235 240 245 



gtg get gtg egg cag cct ggc gee etc acg egg ggc ttg egg acg cac 1123 
Val Ala Val Arg Gin Pro Gly Ala Leu Thr Arg Gly Leu Arg Thr His 
250 255 260 



ate gac atg gtc gtg ttg tec gee acg etc tgc tec get etc tac gtg 1171 
He Asp Met Val Val Leu Ser Ala Thr Leu Cys Ser Ala Leu Tyr Val 
265 270 275 



ggg gac etc tgt ggc ggg gtg atg etc gcg tec cag atg ttc att gtc 1219 
Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ser Gin Met Phe He Val 
280 285 290 
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teg ccg cag 
Ser Pro Gin 
295 

cct ggc gec 
Pro Gly Ala 
310 

tgg teg ccc 
Trp Ser Pro 

gag gtc ate 
Glu Val He 

ggc ctg gee 
Gly Leu Ala 
360 

ate etc ctg 
He Leu Leu 
375 

age get get 
Ser Ala Ala 
390 



cac cac tgg 
His His Trp 

ate act ggg 
He Thr Gly 
315 

acg acc ace 
Thr Thr Thr 
330 

ata gac ate 
He Asp He 
345 

tac ttc tct 
Tyr Phe Ser 

ctg gec tct 
Leu Ala Ser 

ggg cgc act 
Gly Arg Thr 
395 



ttc gtg cag 
Phe Val Gin 
300 

cac cgt atg 
His Arg Met 

atg ate ctg 
Met He Leu 

att age gga 
He Ser Gly 
350 

atg cag gga 
Met Gin Gly 
365 

ggg gtg gac 
Gly Val Asp 
380 

acc agt age 
Thr Ser Ser 



gaa tgc aat 
Glu Cys Asn 
305 

gca tgg gac 
Ala Trp Asp 
320 

gcg tac gtg 
Ala Tyr Val 
335 

get cac tgg 
Ala His Trp 

gcg tgg gcg 
Ala Trp Ala 



gcg tac acc 
Ala Tyr Thr 
385 

ctg gec age 
Leu Ala Ser 
400 



tgc tec ate 
Cys Ser lie 

atg atg atg 
Met Met Met 

atg cgc gtt 
Met Arg Val 
340 

ggc gtc atg 
Gly Val Met 
355 

aag gtc gtt 
Lys Val Val 
370 

acc acg act 
Thr Thr Thr 



gec ttc tec 
Ala Phe Ser 



tac 1267 
Tyr 

aac 1315 

Asn 

325 

ccc 1363 
Pro 

ttt 1411 
Phe 

gtc 1459 
Val 



ggg 1507 
Gly 

cct 1555 

Pro 

405 
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ggc get egg cag aac att cag etc att aat acc aat ggt age tgg cac 1603 
Gly Ala Arg Gin Asn lie Gin Leu He Asn Thr Asn Gly Ser Trp His 
410 415 420 

ate aac cgc acc gee ctg aat tgc aac gat tec ttg cac acc ggc ttc 1651 
He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu His Thr Gly Phe 
425 430 435 

ttc acg gec ctg ttc tac ate cat aag ttc aac teg teg gga tgt ccc 1699 
Phe Thr Ala Leu Phe Tyr He His Lys Phe Asn Ser Ser Gly Cys Pro 
440 445 450 

gag cgc ctg tec gec tgt cgc aac ate gag gac ttc egg ata gga tgg 1747 
Glu Arg Leu Ser Ala Cys Arg Asn He Glu Asp Phe Arg He Gly Trp 
455 460 465 

ggc gec ctg caa tac gac gac aat gtc acc aat cca gaa gat atg agg 1795 
Gly Ala Leu Gin Tyr Asp Asp Asn Val Thr Asn Pro Glu Asp Met Arg 
470 475 480 485 

cca tat tgc tgg cac tac cca cca aaa cag tgt ggc gta gtc ccc gca 1843 
Pro Tyr Cys Trp His Tyr Pro Pro Lys Gin Cys Gly Val Val Pro Ala 
490 495 500 

ggg acc gtg tgc ggc cca gtg tac tgt ttc acc cct age ccg gtg gta 1891 
Gly Thr Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val 
505 510 515 



gtg ggc acg acc gat aga ctt gga gtg cct act tac acg tgg gga gag 1939 
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Val Gly Thr Thr Asp Arg Leu Gly Val Pro Thr Tyr Thr Trp Gly Glu 
520 525 530 



aat gag aca 
Asn Glu Thr 
535 

tea tgg ttt 
Ser Trp Phe 
550 

tgc ggc gca 
Cys Gly Ala 

gat ctg ttg 
Asp Leu Leu 

tac ate aaa 
Tyr lie Lys 
600 

gac tac ccc 
Asp Tyr Pro 
615 

acc ttc aag 
Thr Phe Lys 



gat gtc ttc 
Asp Val Phe 

ggc tgc acg 
Gly Cys Thr 
555 

cca ccc tgc 
Pro Pro Cys 
570 

tgc ccc acg 
Cys Pro Thr 
585 

tgt ggt tec 
Cys Gly Ser 

tac agg etc 
Tyr Arg Leu 

ate agg atg 
lie Arg Met 



eta ttg aac 
Leu Leu Asn 
540 

tgg atg aac 
Trp Met Asn 

cgc act aga 
Arg Thr Arg 

gac tgt ttt 
Asp Cys Phe 
590 

ggg cct tgg 
Gly Pro Trp 
605 

tgg cat tac 
Trp His Tyr 
620 

tat gtg ggg 
Tyr Val Gly 



age acc cga 
Ser Thr Arg 
545 

tec act ggc 
Ser Thr Gly 
560 

get gac ttc 
Ala Asp Phe 
575 

aga aaa cat 
Arg Lys His 

etc acg cca 
Leu Thr Pro 

cct tgc aca 
Pro Cys Thr 
625 

gga gtt gag 
Gly Val Glu 
117 



cca ccg teg 
Pro Pro Ser 

ttc acc aag 
Phe Thr Lys 

aat acc age 
Asn Thr Ser 
580 

cct gaa gee 
Pro Glu Ala 
595 

aag tgt ctg 
Lys Cys Leu 
610 

gtc aat tac 
Val Asn Tyr 

cac agg etc 
His Arg Leu 



ggg 1987 
Gly 

acc 2035 

Thr 

565 

aca 2083 
Thr 



act 2131 
Thr 



gtt 2179 
Val 



tec 2227 
Ser 

atg 2275 
Met 



630 



635 



640 



645 



gcc gcg tgc aat ttc act cgt ggg gat cgc tgc aac ttg gag gat agg 2323 
Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys Asn Leu Glu Asp Arg 
650 655 660 



gac aga agt 
Asp Arg Ser 

att ttg ccc 
lie Leu Pro 
680 

etc cac etc 
Leu His Leu 
695 

tea cet gcc 
Ser Pro Ala 
710 



caa cag act 
Gin Gin Thr 
665 

tgc tct ttc 
Cys Ser Phe 

cac caa aat 
His Gin Asn 

etc aca caa 
Leu Thr Gin 
715 



cct ctg ttg 
Pro Leu Leu 
670 

tea gac ttg 
Ser Asp Leu 
685 

ate gtg gac 
lie Val Asp 
700 

tat ate gtt 
Tyr He Val 



cac tec acc 
His Ser Thr 

ccc get ttg 
Pro Ala Leu 

gta caa tat 
Val Gin Tyr 
705 

cga tgg gag 
Arg Trp Glu 
720 



acg gaa tgg 
Thr Glu Trp 
675 

teg act ggt 
Ser Thr Gly 
690 

atg tat ggc 
Met Tyr Gly 



tgg gta gta 
Trp Val Val 



gcc 2371 
Ala 

ctt 2419 
Leu 

ctg 2467 
Leu 

etc 2515 

Leu 

725 



tta ttc ctg etc eta gcg gac gcc agg gtc tgc gcc tgc ttg tgg atg 2563 
Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp Met 
730 735 740 



etc ate ttg ctg ggc caa gcc gaa gca gca ctg gag aag ctg gtc gtc 2611 

Leu He Leu Leu Gly Gin Ala Glu Ala Ala Leu Glu Lys Leu Val Val 
745 750 755 
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ttg cac get 
Leu His Ala 
760 

ate ttt etc 
lie Phe Leu 
775 

get get tat 
Ala Ala Tyr 
790 

gca ctg ccc 
Ala Leu Pro 

gtg ggc gcg 
Val Gly Ala 



ggg tat aag 
Gly Tyr Lys 
840 

ctg ace ctg 
Leu Thr Leu 
855 



gcg age gca 
Ala Ser Ala 



gtg get get 
Val Ala Ala 

tec ctt act 
Ser Leu Thr 
795 

cag cag get 
Gin Gin Ala 
810 

get ttg eta 
Ala Leu Leu 
825 

ace ctt etc 
Thr Leu Leu 

gcg gaa acc 
Ala Glu Thr 



get age tgc 
Ala Ser Cys 
765 

tgg cac ate 

Trp His lie 
780 

ggc ctg tgg 

Gly Leu Trp 

tac gee tat 
Tyr Ala Tyr 

gta ctg att 
Val Leu He 
830 

age cag tec 
Ser Gin Ser 
845 

atg gtc cag 
Met Val Gin 
860 



aat ggc ttc 
Asn Gly Phe 

aag ggt agg 
Lys Gly Arg 
785 

ccg ttc tgc 
Pro Phe Cys 
800 

gat gca tct 
Asp Ala Ser 
815 

acc etc ttt 
Thr Leu Phe 

ctg tgg tgg 
Leu Trp Trp 

gag tgg gca 
Glu Trp Ala 
865 



ctg tat ttt 
Leu Tyr Phe 
770 

gtg gtc ccc 
Val Val Pro 

eta ctg etc 
Leu Leu Leu 

gtg cac gga 
Val His Gly 
820 

aca etc acc 
Thr Leu Thr 
835 

ttg tgc tat 
Leu Cys Tyr 
850 

cca tec atg 
Pro Ser Met 



gtc 2659 
Val 

ttg 2707 
Leu 

eta 2755 

Leu 

805 

cag 2803 
Gin 



ccg 2851 
Pro 

etc 2899 
Leu 

cag 2947 
Gin 
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gcg cgc ggc ggc cgt gat ggc ate ata tgg gec gec acc ata ttt tgc 2995 
Ala Arg Gly Gly Arg Asp Gly He He Trp Ala Ala Thr He Phe Cys 
870 875 880 885 



ccg ggc gta 
Pro Gly Val 



cct ggt tac 
Pro Gly Tyr 

aga gec cac 
Arg Ala His 
920 

ggg ggt agg 
Gly Gly Arg 
935 

ggc act tac 
Gly Thr Tyr 
950 

age ggc ctg 
Ser Gly Leu 



gtg ttt gac 
Val Phe Asp 
890 

etc eta aga 
Leu Leu Arg 
905 

get ctg ctg 
Ala Leu Leu 

tac gtc cag 
Tyr Val Gin 

ate tat gac 
He Tyr Asp 
955 

egg gac ttg 
Arg Asp Leu 
970 



ata acc aag 
He Thr Lys 

ggt get ttg 
Gly Ala Leu 
910 

aga atg tgc 
Arg Met Cys 
925 

atg gcg eta 
Met Ala Leu 
940 

cac etc acc 
His Leu Thr 

gcg gtc get 
Ala Val Ala 



tgg etc tta 
Trp Leu Leu 
895 

acg cgc gtg 
Thr Arg Val 



act atg gtg 
Thr Met Val 

tta gec ctt 
Leu Ala Leu 
945 

cct atg teg 
Pro Met Ser 
960 

gtg gag cct 
Val Glu Pro 
975 



gcg gtg ctt 
Ala Val Leu 
900 

cca tat ttc 
Pro Tyr Phe 
915 

agg cac etc 
Arg His Leu 
930 

ggc agg tgg 
Gly Arg Trp 

gat tgg get 
Asp Trp Ala 

ate ate ttc 
He He Phe 
980 



ggg 3043 
Gly 



gtc 3091 
Val 



gcg 3139 
Ala 



act 3187 
Thr 

get 3235 

Ala 

965 

agt 3283 
Ser 



ccg atg gag aag aaa gtc ate gtt tgg gga gcg gag acg get gcg tgc 
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3331 



Pro Met Glu Lys Lys Val He Val Trp Gly Ala Glu Thr Ala Ala Cys 
985 990 995 

ggg gac ate ttg cac gga ctt ccc gtg tec gee cga etc ggt egg gag 3379 
Gly Asp He Leu His Gly Leu Pro Val Ser Ala Arg Leu Gly Arg Glu 
1000 1005 1010 

ate etc ctt ggc cca get gat ggc tac ace tec aag ggg tgg aag ctt 3427 
He Leu Leu Gly Pro Ala Asp Gly Tyr Thr Ser Lys Gly Trp Lys Leu 
1015 1020 1025 

etc gee ccc ate acc get tac gee cag cag aca cga ggt etc ttg ggc 3475 
Leu Ala Pro He Thr Ala Tyr Ala Gin Gin Thr Arg Gly Leu Leu Gly 
1030 1035 1040 1045 

tct ata gtg gtg age atg acg ggg cgt gac aag aca gaa cag gec ggg 3523 
Ser He Val Val Ser Met Thr Gly Arg Asp Lys Thr Glu Gin Ala Gly 
1050 1055 1060 

gag gtc caa gtc ctg tec aca gtc act cag tec ttc etc gga aca tec 3571 
Glu Val Gin Val Leu Ser Thr Val Thr Gin Ser Phe Leu Gly Thr Ser 
1065 1070 1075 

att teg ggg gtc tta tgg act gtt tac cac gga get ggc aac aag aca 3619 
He Ser Gly Val Leu Trp Thr Val Tyr His Gly Ala Gly Asn Lys Thr 
1080 1085 1090 

eta gee ggc teg egg ggc ccg gtc acg cag atg tac teg age gee gag 3667 
Leu Ala Gly Ser Arg Gly Pro Val Thr Gin Met Tyr Ser Ser Ala Glu 
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* 



1095 1100 1105 

ggg gac ttg gtc ggg tgg ccc age cct cct ggg acc aaa tct ttg gag 3715 
Gly Asp Leu Val Gly Trp Pro Ser Pro Pro Gly Thr Lys Ser Leu Glu 
1110 1115 1120 1125 

ccg tgt acg tgt gga gcg gtc gac ctg tat ttg gtc acg egg aac get 3763 
Pro Cys Thr Cys Gly Ala Val Asp Leu Tyr Leu Val Thr Arg Asn Ala 
1130 1135 1140 

gat gtc ate ccg get cga aga cgc ggg gac aag egg gga gcg ctg etc 3811 
Asp Val He Pro Ala Arg Arg Arg Gly Asp Lys Arg Gly Ala Leu Leu 
1145 1150 1155 

tec ccg aga ccc ctt teg acc ttg aag ggg tec teg ggg gga cct gtg 3859 
Ser Pro Arg Pro Leu Ser Thr Leu Lys Gly Ser Ser Gly Gly Pro Val 
1160 1165 1170 

ctt tgc cct agg ggc cac get gtc gga ate ttc egg gca get gtg tgc 3907 
Leu Cys Pro Arg Gly His Ala Val Gly He Phe Arg Ala Ala Val Cys 
1175 1180 1185 

tct egg ggt gtg get aag tec ata gat ttc ate ccc gtt gag acg etc 3955 
Ser Arg Gly Val Ala Lys Ser He Asp Phe He Pro Val Glu Thr Leu 
1190 1195 1200 1205 

gac ate gtc acg egg tct ccc acc ttt agt gac aac age aca cca cca 4003 

Asp He Val Thr Arg Ser Pro Thr Phe Ser Asp Asn Ser Thr Pro Pro 
1210 1215 1220 
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get gtg ccc cag acc tat cag gtg ggg tac ttg cac gec ccc act ggc 4051 
Ala Val Pro Gin Thr Tyr Gin Val Gly Tyr Leu His Ala Pro Thr Gly 
1225 1230 1235 

agt gga aaa age acc aag gtc ccc gtc gcg tac gec gec cag ggg tat 4099 
Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr Ala Ala Gin Gly Tyr 
1240 1245 1250 

aaa gtg ctg gtg etc aat ccc teg gtg get gee acc ctg gga ttt ggg 4147 
Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala Thr Leu Gly Phe Gly 
1255 1260 1265 

gcg tac ttg tec aag gca cat ggc ate aac ccc aac att agg act gga 4195 
Ala Tyr Leu Ser Lys Ala His Gly lie Asn Pro Asn lie Arg Thr Gly 
1270 1275 1280 1285 

gtc aga act gtg acg acc ggg gag ccc att aca tac tec acg tat ggt 4243 
Val Arg Thr Val Thr Thr Gly Glu Pro He Thr Tyr Ser Thr Tyr Gly 
1290 1295 1300 

aaa ttc etc gee gat ggg ggc tgc gca ggc ggc gee tat gac ate ate 4291 
Lys Phe Leu Ala Asp Gly Gly Cys Ala Gly Gly Ala Tyr Asp He lie 
1305 1310 1315 

ata tgc gat gaa tgc cac tct gtg gat get acc act att etc ggc ate 4339 
lie Cys Asp Glu Cys His Ser Val Asp Ala Thr Thr He Leu Gly Tie 
1320 1325 1330 
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ggg aca gtc ctt gac caa gca gag aca gcc ggg gtc agg eta act gta 4387 
Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly Val Arg Leu Thr Val 
1335 1340 1345 

ctg gcc acg gcc acg ccc ccc ggg teg gtg aca acc ccc cat ccc aat 4435 
Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Thr Pro His Pro Asn 
1350 1355 1360 1365 

ata gag gag gta gcc etc gga cag gag ggt gag ate ccc ttc tat ggg 4483 
He Glu Glu Val Ala Leu Gly Gin Glu Gly Glu He Pro Phe Tyr Gly 
1370 1375 1380 

agg gcg ttt ccc ctg tct tac ate aag gga ggg agg cac ttg att ttc 4531 
Arg Ala Phe Pro Leu Ser Tyr lie Lys Gly Gly Arg His Leu He Phe 
1385 1390 1395 

tgc cac tea aag aaa aag tgt gac gag etc gca acg gcc ctt egg ggc 4579 
Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala Thr Ala Leu Arg Gly 
1400 1405 1410 

atg ggc ttg aac get gtg gca tat tac aga ggg ttg gac gtc tec ata 4627 
Met Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser lie 
1415 1420 1425 

ata cca act caa gga gat gtg gtg gtc gtt gcc acc gac gcc etc atg 4675 
He Pro Thr Gin Gly Asp Val Val Val Val Ala Thr Asp Ala Leu Met 
1430 1435 1440 1445 



acg ggg tat act gga gac ttt gac tec gtg ate gac tgc aac gta gcg 
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4723 



Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val He Asp Cys Asn Val Ala 
1450 1455 1460 

gtc acc cag gcc gta gac ttc age ctg gac ccc acc ttc act ata acc 4771 
Val Thr Gin Ala Val Asp Phe Ser Leu Asp Pro Thr Phe Thr He Thr 
1465 1470 1475 

aca cag act gtc ccg caa gac get gtc tea cgt agt cag cgc cga ggg 4819 
Thr Gin Thr Val Pro Gin Asp Ala Val Ser Arg Ser Gin Arg Arg Gly 
1480 1485 1490 

cgc acg ggt aga gga aga ctg ggc att tat agg tat gtt tec act ggt 4867 
Arg Thr Gly Arg Gly Arg Leu Gly He Tyr Arg Tyr Val Ser Thr Gly 
1495 1500 1505 

gag cga gcc tea gga atg ttt gac agt gta gta etc tgt gag tgc tac 4915 
Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val Leu Cys Glu Cys Tyr 
1510 1515 1520 1525 

gac gca gga get get tgg tat gag etc tea cca gtg gag acg acc gtc 4963 
Asp Ala Gly Ala Ala Trp Tyr Glu Leu Ser Pro Val Glu Thr Thr Val 
1530 1535 1540 

agg etc agg gcg tat ttc aac acg cct ggc ttg cct gtg tgc cag gac 5011 
Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu Pro Val Cys Gin Asp 
1545 1550 1555 



cac ctt gag ttt tgg gag gca gtt ttc acc ggc 
His Leu Glu Phe Trp Glu Ala Val Phe Thr Gly 
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etc aca cac ata gac 
Leu Thr His He Asp 



5059 



1560 1565 1570 

get cat ttc ctt tec cag aca aag cag teg ggg gaa aat ttc gca tac 5107 
Ala His Phe Leu Ser Gin Thr Lys Gin Ser Gly Glu Asn Phe Ala Tyr 
1575 1580 1585 

tta gta gee tat cag gee aca gtg tgc gec agg gec aaa gcg ccc ccc 5155 
Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Lys Ala Pro Pro 
1590 1595 1600 1605 

ccg tec tgg gac gtc atg tgg aag tgc ttg act cga etc aag ccc acg 5203 
Pro Ser Trp Asp Val Met Trp Lys Cys Leu Thr Arg Leu Lys Pro Thr 
1610 1615 1620 

ctt gtg ggc cct aca cct etc ctg tac cgt ttg ggc tct gtt acc aac 5251 
Leu Val Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Ser Val Thr Asn 
1625 1630 1635 

gag gtc acc ctt aca cac ccc gtg aca aaa tac ate gec aca tgc atg 5299 
Glu Val Thr Leu Thr His Pro Val Thr Lys Tyr He Ala Thr Cys Met 
1640 1645 1650 

caa get gac etc gag gtc atg acc age acg tgg gtc ctg get ggg gga 5347 
Gin Ala Asp Leu Glu Val Met Thr Ser Thr Trp Val Leu Ala Gly Gly 
1655 1660 1665 

gtc tta gca gee gtc gee gcg tat tgc tta gcg acc ggg tgt gtt tec 5395 
Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala Thr Gly Cys Val Ser 
1670 1675 1680 1685 
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4 



ate att ggc cgt tta cac ate aac cag cga get gtc gtc get ccg gac 5443 

lie lie Gly Arg Leu His lie Asn Gin Arg Ala Val Val Ala Pro Asp 

1690 1695 1700 

aag gag gtc etc tat gag get ttt gat gag atg gag gaa tgt gee tec 5491 

Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met Glu Glu Cys Ala Ser 

1705 1710 1715 



aga gcg get etc ctt gaa gag ggg cag egg ata gec gag atg ctg aag 5539 

Arg Ala Ala Leu Leu Glu Glu Gly Gin Arg He Ala Glu Met Leu Lys 

1720 1725 1730 

tec aag ate caa ggc tta ttg cag caa gec tct aaa cag gee cag gac 5587 

Ser Lys He Gin Gly Leu Leu Gin Gin Ala Ser Lys Gin Ala Gin Asp 

1735 1740 1745 



ata caa ccc get gtg caa get teg tgg ccc aag atg gag caa ttc tgg 5635 
He Gin Pro Ala Val Gin Ala Ser Trp Pro Lys Met Glu Gin Phe Trp 
1750 1755 1760 1765 



gee aaa cat atg tgg aac ttc ata age ggc att cag tac etc gca gga 5683 
Ala Lys His Met Trp Asn Phe He Ser Gly He Gin Tyr Leu Ala Gly 
1770 1775 1780 



ctg tea aca ctg cca ggg aac cct get gtg get tec atg atg gca ttc 5731 
Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala Ser Met Met Ala Phe 
1785 1790 1795 
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age gec gec etc ace agt ccg ttg tea act age ace ace ate ctt ctt 5779 

Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser Thr Thr He Leu Leu 
1800 1805 1810 

aac att ctg ggg ggc tgg ctg gcg tec caa att gcg cca ccc gcg ggg 5827 

Asn He Leu Gly Gly Trp Leu Ala Ser Gin He Ala Pro Pro Ala Gly 
1815 1820 1825 

gec act ggc ttt gtt gtc agt ggc ctg gtg gga get get gtt ggc age 5875 

Ala Thr Gly Phe Val Val Ser Gly Leu Val Gly Ala Ala Val Gly Ser 
1830 1835 1840 1845 



ata ggc ttg ggt aaa gtg ctg gtg gac ate ctg gca ggg tat ggt gcg 5923 

He Gly Leu Gly Lys Val Leu Val Asp He Leu Ala Gly Tyr Gly Ala 

1850 1855 1860 

ggc att teg ggg gec etc gtc gcg ttt aag ate atg tct ggc gag aag 5971 

Gly He Ser Gly Ala Leu Val Ala Phe Lys He Met Ser Gly Glu Lys 

1865 1870 1875 



ccc tec atg gag gat gtc ate aac ttg ctg cct ggg att ctg tct cca 6019 
Pro Ser Met Glu Asp Val He Asn Leu Leu Pro Gly He Leu Ser Pro 
1880 1885 1890 



ggt get ctg gtg gtg gga gtc ate tgc gcg gec att ctg cgc cgc cat 6067 
Gly Ala Leu Val Val Gly Val He Cys Ala Ala He Leu Arg Arg His 
1895 1900 1905 

gtg gga ccg ggg gaa ggc gcg gtc caa tgg atg aac agg ctt ate gec 6115 
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Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met Asn Arg Leu He Ala 
1910 1915 1920 1925 

ttc get tec aga gga aac cac gtc gec cct act cac tac gtg acg gag 6163 

Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr His Tyr Val Thr Glu 
1930 1935 1940 

teg gat gcg teg cag cgt gtc ace caa ctg ctt ggc tct etc act ata 6211 

Ser Asp Ala Ser Gin Arg Val Thr Gin Leu Leu Gly Ser Leu Thr He 

1945 1950 1955 

act agt eta etc agg aga ctt cac aac tgg ate act gag gat tgc ccc 6259 

Thr Ser Leu Leu Arg Arg Leu His Asn Trp He Thr Glu Asp Cys Pro 

1960 1965 1970 

ate cca tgc gec ggc teg tgg etc cgc gat gtg tgg gac tgg gtc tgt 6307 

He Pro Cys Ala Gly Ser Trp Leu Arg Asp Val Trp Asp Trp Val Cys 
1975 1980 1985 

acc ate eta aca gac ttt aag aac tgg ctg acc tec aag ctg ttc cca 6355 

Thr He Leu Thr Asp Phe Lys Asn Trp Leu Thr Ser Lys Leu Phe Pro 
1990 1995 2000 2005 

aag atg cct ggc etc ccc ttt ate tct tgc caa aag ggg tac aag ggc 6403 

Lys Met Pro Gly Leu Pro Phe He Ser Cys Gin Lys Gly Tyr Lys Gly 
2010 2015 2020 



gtg tgg gec ggc act ggc ate atg acc aca cga tgc ccc tgc ggc gec 
Val Trp Ala Gly Thr Gly lie Met Thr Thr Arg Cys Pro Cys Gly Ala 
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6451 



2025 2030 2035 

aac ate tct ggc aac gtc cgc ttg ggc tct atg aga ate aca gga ccc 6499 
Asn He Ser Gly Asn Val Arg Leu Gly Ser Met Arg He Thr Gly Pro 
2040 2045 2050 

aaa ace tgc atg aac acc tgg cag ggg acc ttt cct ate aat tgt tat 6547 
Lys Thr Cys Met Asn Thr Trp Gin Gly Thr Phe Pro He Asn Cys Tyr 
2055 2060 2065 

aca gaa ggc cag tgc ttg ccg aaa ccc gcg tta aac ttc aag acc gec 6595 
Thr Glu Gly Gin Cys Leu Pro Lys Pro Ala Leu Asn Phe Lys Thr Ala 
2070 2075 2080 2085 

ate tgg aga gtg gcg gec tea gag tac gcg gaa gtg acg cag cac gga 6643 
He Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu Val Thr Gin His Gly 
2090 2095 2100 

tea tat gec tat ata aca ggg ctg acc act gac aac tta aaa gtc cct 6691 
Ser Tyr Ala Tyr lie Thr Gly Leu Thr Thr Asp Asn Leu Lys Val Pro 
2105 2110 2115 

tgc caa etc ccc tct cca gag ttt ttc tct tgg gtg gac gga gta caa 6739 
Cys Gin Leu Pro Ser Pro Glu Phe Phe Ser Trp Val Asp Gly Val Gin 
2120 2125 2130 

ate cat agg tec gec ccc aca cca aag ccg ttt ttc egg gat gag gtc 6787 
lie His Arg Ser Ala Pro Thr Pro Lys Pro Phe Phe Arg Asp Glu Val 
2135 2140 2145 
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teg ttc age gtt ggg etc aat tea ttt gtc gtc ggg tct cag ctt ccc 6835 
Ser Phe Ser Val Gly Leu Asn Ser Phe Val Val Gly Ser Gin Leu Pro 
2150 2155 2160 2165 

tgt gac cct gag ccc gac act gag gta gtg atg tec atg eta aca gac 6883 
Cys Asp Pro Glu Pro Asp Thr Glu Val Val Met Ser Met Leu Thr Asp 
2170 2175 2180 

cca tec cat ate acg gcg gag get gca gcg egg cgt tta gcg egg ggg 6931 
Pro Ser His He Thr Ala Glu Ala Ala Ala Arg Arg Leu Ala Arg Gly 
2185 2190 2195 

tea ccc cca tct gag gca age tec tea gcg age cag ctg teg gcg cca 6979 
Ser Pro Pro Ser Glu Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro 
2200 2205 2210 

teg ctg cga gee ace tgc acc ace cac ggt agg acc tat gat gtg gac 7027 
Ser Leu Arg Ala Thr Cys Thr Thr His Gly Arg Thr Tyr Asp Val Asp 
2215 2220 2225 

atg gtg gat gec aac ctg ttc atg ggg ggc ggc gtg att egg ata gag 7075 
Met Val Asp Ala Asn Leu Phe Met Gly Gly Gly Val He Arg He Glu 
2230 2235 2240 2245 

tct gag tec aaa gtg gtc gtt ctg gac tec etc gac tea atg acc gag 7123 
Ser Glu Ser Lys Val Val Val Leu Asp Ser Leu Asp Ser Met Thr Glu 
2250 2255 2260 
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gaa gag ggc gac ctt gag cct tea gta cca teg gag tat atg etc ccc 7171 

Glu Glu Gly Asp Leu Glu Pro Ser Val Pro Ser Glu Tyr Met Leu Pro 
2265 2270 2275 

agg aag agg ttc cca ccg gec tta ccg get tgg gcg egg cct gat tac 7219 

Arg Lys Arg Phe Pro Pro Ala Leu Pro Ala Trp Ala Arg Pro Asp Tyr 
2280 2285 2290 

aac cca ccg ctt gtg gaa teg tgg aag agg cca gat tac caa cca ccc 7267 

Asn Pro Pro Leu Val Glu Ser Trp Lys Arg Pro Asp Tyr Gin Pro Pro 
2295 2300 2305 

act gtt gcg ggc tgt get etc ccc ccc ccc aaa aag ace ccg acg cct 7315 

Thr Val Ala Gly Cys Ala Leu Pro Pro Pro Lys Lys Thr Pro Thr Pro 

2310 2315 2320 2325 

cct cca agg aga cgc egg aca gtg ggt ctg age gag age ace ata gga 7363 

Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser Glu Ser Thr He Gly 

2330 2335 2340 

gat gee etc caa cag ctg gee ate aag tec ttt ggc cag ccc ccc cca 7411 

Asp Ala Leu Gin Gin Leu Ala He Lys Ser Phe Gly Gin Pro Pro Pro 
2345 2350 2355 

age ggc gat tea ggc ctt tec acg ggg gcg gac gee gee gac tec ggc 7459 

Ser Gly Asp Ser Gly Leu Ser Thr Gly Ala Asp Ala Ala Asp Ser Gly 
2360 2365 2370 



gat egg aca ccc cct gac gag ttg get ctt teg gag aca ggt tct acc 7507 
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Asp Arg Thr Pro Pro Asp Glu Leu Ala Leu Ser Glu Thr Gly Ser Thr 
2375 2380 2385 



tec tec atg ccc ccc etc gag ggg gag cct ggg gac cca gac ctg gag 7555 

Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Glu 
2390 2395 2400 2405 

cct gag cag gta gag ctt caa cct cct ccc cag ggg ggg gag gca get 7603 

Pro Glu Gin Val Glu Leu Gin Pro Pro Pro Gin Gly Gly Glu Ala Ala 
2410 2415 2420 

ccc ggc teg gac teg ggg tec tgg tct act tgc tec gag gag gat gac 7651 

Pro Gly Ser Asp Ser Gly Ser Trp Ser Thr Cys Ser Glu Glu Asp Asp 
2425 2430 2435 

tec gtc gtg tgc tgc tec atg tea tat tec tgg acc ggg get eta ata 7699 

Ser Val Val Cys Cys Ser Met Ser Tyr Ser Trp Thr Gly Ala Leu He 
2440 2445 2450 

act cct tgt age ccc gaa gag gaa aag ttg cca att aac tec ttg age 7747 

Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro He Asn Ser Leu Ser 
2455 2460 2465 

aac teg ctg ttg cga tac cat aac aag gta tac tgt act aca tea aag 7795 

Asn Ser Leu Leu Arg Tyr His Asn Lys Val Tyr Cys Thr Thr Ser Lys 
2470 2475 2480 2485 

agt gee tea eta agg get aaa aag gta act ttt gat agg atg caa gtg 7843 

Ser Ala Ser Leu Arg Ala Lys Lys Val Thr Phe Asp Arg Met Gin Val 
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2490 



2495 



2500 



etc gac gec tat tat gat tea gtc tta aag gac ate aag eta gcg gec 7891 

Leu Asp Ala Tyr Tyr Asp Ser Val Leu Lys Asp lie Lys Leu Ala Ala 
2505 2510 2515 

tec aag gtc age gca agg etc etc acc tta gag gag gcg tgc caa ttg 7939 

Ser Lys Val Ser Ala Arg Leu Leu Thr Leu Glu Glu Ala Cys Gin Leu 
2520 2525 2530 

acc cca ccc cac tct gca aga tec aag tat ggg ttt ggg get aag gag 7987 

Thr Pro Pro His Ser Ala Arg Ser Lys Tyr Gly Phe Gly Ala Lys Glu 
2535 2540 2545 

gtc cgc age ttg tec ggg agg gee gtc aac cac ate aag tec gtg tgg 8035 

Val Arg Ser Leu Ser Gly Arg Ala Val Asn His He Lys Ser Val Trp 
2550 2555 2560 2565 

aag gac etc ttg gaa gac tea caa aca cca att cct aca acc ate atg 8083 

Lys Asp Leu Leu Glu Asp Ser Gin Thr Pro lie Pro Thr Thr He Met 
2570 2575 2580 

gec aaa aat gag gtg ttc tgc gtg gac ccc gee aag ggg ggt aaa aaa 8131 

Ala Lys Asn Glu Val Phe Cys Val Asp Pro Ala Lys Gly Gly Lys Lys 
2585 2590 2595 

cca get cgc ctt ate gtt tac cct gac etc ggc gtc agg gtc tgc gag 8179 

Pro Ala Arg Leu He Val Tyr Pro Asp Leu Gly Val Arg Val Cys Glu 
2600 2605 2610 
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aag atg gcc ctt tat gat gtc aca caa aag ctt cct cag gcg gtg atg 8227 
Lys Met Ala Leu Tyr Asp Val Thr Gin Lys Leu Pro Gin Ala Val Met 
2615 2620 2625 

ggg get tct tat ggc ttc cag tac tec ccc get cag egg gtg gag ttt 8275 
Gly Ala Ser Tyr Gly Phe Gin Tyr Ser Pro Ala Gin Arg Val Glu Phe 
2630 2635 2640 2645 

etc ttg aag gca tgg gcg gaa aag aga gac cct atg ggt ttt teg tat 8323 
Leu Leu Lys Ala Trp Ala Glu Lys Arg Asp Pro Met Gly Phe Ser Tyr 
2650 2655 2660 

gat ace cga tgc ttt gac tea ace gtc act gag aga gac ate agg act 8371 
Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Arg Asp He Arg Thr 
2665 2670 2675 

gag gag tec ata tac cag gcc tgc tec tta ccc gag gag gcc cga act 8419 
Glu Glu Ser He Tyr Gin Ala Cys Ser Leu Pro Glu Glu Ala Arg Thr 
2680 2685 2690 

gcc ata cac teg ctg act gag aga etc tat gtg gga ggg ccc atg ttc 8467 
Ala He His Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly Pro Met Phe 
2695 2700 2705 

aac age aag ggc cag tec tgc ggg tac agg cgt tgc cgc gcc age ggg 8515 
Asn Ser Lys Gly Gin Ser Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly 
2710 2715 2720 2725 
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gtg ctt acc act agt atg ggg aac acc ate aca tgc tat gta aaa gec 8563 
Val Leu Thr Thr Ser Met Gly Asn Thr He Thr Cys Tyr Val Lys Ala 
2730 2735 2740 

eta gcg get tgc aag get gcg ggg ata att gcg ccc acg atg ctg gta 8611 
Leu Ala Ala Cys Lys Ala Ala Gly He He Ala Pro Thr Met Leu Val 
2745 2750 2755 

tgc ggc gac gac ttg gtc gtc ate tea gaa age cag ggg act gag gag 8659 
Cys Gly Asp Asp Leu Val Val He Ser Glu Ser Gin Gly Thr Glu Glu 
2760 2765 2770 

gac gag egg aac ctg aga gec ttc acg gag get atg acc agg tat tct 8707 
Asp Glu Arg Asn Leu Arg Ala Phe Thr Glu Ala Met Thr Arg Tyr Ser 
2775 2780 2785 

gec cct cct ggt gac ccc ccc aga ccg gaa tat gac ctg gag eta ata 8755 
Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr Asp Leu Glu Leu He 
2790 2795 2800 2805 

aca tct tgt tec tea aac gtg tct gtg gca ctt ggc cca cag ggc cgc 8803 
Thr Ser Cys Ser Ser Asn Val Ser Val Ala Leu Gly Pro Gin Gly Arg 
2810 2815 2820 

cgc aga tac tac ctg acc aga gac ccc acc act tea att gec egg get 8851 
Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Ser He Ala Arg Ala 
2825 2830 2835 



gec tgg gaa aca gtt aga cac tec cct gtc aat tea tgg ctg gga aac 
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8899 



Ala Trp Glu Thr Val Arg His Ser Pro Val Asn Ser Trp Leu Gly Asn 
2840 2845 2850 

ate ate cag tac get cca acc ata tgg gtt cgc atg gtc ctg atg aca 8947 
He He Gin Tyr Ala Pro Thr lie Trp Val Arg Met Val Leu Met Thr 
2855 2860 2865 

cac ttc ttc tec att etc atg gee cag gac acc eta gac cag aac ctt 8995 
His Phe Phe Ser lie Leu Met Ala Gin Asp Thr Leu Asp Gin Asn Leu 
2870 2875 2880 2885 

aac ttt gaa atg tac gga teg gtg tac tec gtg agt cct ctg gac etc 9043 
Asn Phe Glu Met Tyr Gly Ser Val Tyr Ser Val Ser Pro Leu Asp Leu 
2890 2895 2900 

cca gee ata att gaa agg tta cac ggg ctt gac gec ttc tct ctg cac 9091 
Pro Ala He He Glu Arg Leu His Gly Leu Asp Ala Phe Ser Leu His 
2905 2910 2915 

aca tac act ccc cac gaa ctg acg egg gtg get tea gee etc aga aaa 9139 
Thr Tyr Thr Pro His Glu Leu Thr Arg Val Ala Ser Ala Leu Arg Lys 
2920 2925 2930 

ctt ggg gcg cca ccc etc aga gcg tgg aag agt egg gcg cgt gca gtt 9187 
Leu Gly Ala Pro Pro Leu Arg Ala Trp Lys Ser Arg Ala Arg Ala Val 
2935 2940 2945 

agg gcg tec etc ate tec cgt ggg ggg agg gcg gee gtt tgc ggt egg 9235 
Arg Ala Ser Leu He Ser Arg Gly Gly Arg Ala Ala Val Cys Gly Arg 
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2950 2955 2960 2965 

tac etc ttc aac tgg gcg gtg aag acc aag etc aaa etc act cct ttg 9283 
Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu Lys Leu Thr Pro Leu 
2970 2975 2980 

ccg gag gca cgc etc ctg gat ttg tec agt tgg ttt acc gtc ggc gee 9331 
Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp Phe Thr Val Gly Ala 
2985 2990 2995 

ggc ggg ggc gac att tat cac age gtg teg cgt gee cga ccc cgc eta 9379 
Gly Gly Gly Asp lie Tyr His Ser Val Ser Arg Ala Arg Pro Arg Leu 
3000 3005 3010 

tta etc ctt age eta etc eta ctt tct gta ggg gta ggc etc ttc eta 9427 
Leu Leu Leu Ser Leu Leu Leu Leu Ser Val Gly Val Gly Leu Phe Leu 
3015 3020 3025 

etc ccc get cga tag agcggcacac attagctaca ctccatagct aactgttcct 9482 

Leu Pro Ala Arg 

3030 

tttttttttt tttttttttt tttttttttt tttttttctt tttttttttt tttccctctt 9542 

tcttcccttc tcatcttatt ctactttctt tcttggtggc tccatcttag ccctggtcac 9602 

ggctagctgt gaaaggtccg tgagcegcat gaetgeagag agtgccgtaa ctggtctctc 9662 



tgcagatcat gt 
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9674 



<210> 6 
<211> 3033 
<212> PRT 

<213> Hepatitis C virus 
<400> 6 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 

15 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 

20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 

35 40 45 

Thr Arg Lys Ala Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 

50 55 60 

He Pro Lys His Arg Arg Ser Thr Gly Lys Ser Trp Gly Lys Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 

85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro 

100 105 110 

Arg His Arg Ser Arg Asn Val Gly Lys Val He Asp Thr Leu Thr Cys 

115 120 125 

Gly Phe Ala Asp Leu Leu Gly Tyr Val Pro Val Val Gly Ala Pro Leu 

130 135 140 

Ser Gly Val Ala Ser Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Phe Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
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165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys He Thr Thr Pro Val Ser Ala Val 

180 185 190 

Gin Val Lys Asn Thr Ser Asn Ala Tyr Met Ala Thr Asn Asp Cys Ser 

195 200 205 

Asn Asp Ser He Thr Trp Gin Leu Glu Ala Ala Val Leu His Val Pro 

210 215 220 

Gly Cys Val Pro Cys Glu Lys Met Gly Asn Thr Ser Arg Cys Trp He 
225 230 235 240 

Pro Val Ser Pro Asn Val Ala Val Arg Gin Pro Gly Ala Leu Thr Arg 

245 250 255 

Gly Leu Arg Thr His He Asp Met Val Val Leu Ser Ala Thr Leu Cys 

260 265 270 

Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ser 

275 280 285 

Gin Met Phe He Val Ser Pro Gin His His Trp Phe Val Gin Glu Cys 

290 295 300 

Asn Cys Ser He Tyr Pro Gly Ala He Thr Gly His Arg Met Ala Trp 
305 310 315 320 

Asp Met Met Met Asn Trp Ser Pro Thr Thr Thr Met He Leu Ala Tyr 

325 330 335 

Val Met Arg Val Pro Glu Val He He Asp He He Ser Gly Ala His 

340 345 350 

Trp Gly Val Met Phe Gly Leu Ala Tyr Phe Ser Met Gin Gly Ala Trp 

355 360 365 

Ala Lys Val Val Val He Leu Leu Leu Ala Ser Gly Val Asp Ala Tyr 

370 375 380 

Thr Thr Thr Thr Gly Ser Ala Ala Gly Arg Thr Thr Ser Ser Leu Ala 
385 390 395 400 
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Ser Ala Phe Ser Pro Gly Ala Arg Gin Asn lie Gin Leu He Asn Thr 

405 410 415 

Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 

420 425 430 

Leu His Thr Gly Phe Phe Thr Ala Leu Phe Tyr He His Lys Phe Asn 

435 440 445 

Ser Ser Gly Cys Pro Glu Arg Leu Ser Ala Cys Arg Asn lie Glu Asp 

450 455 460 

Phe Arg lie Gly Trp Gly Ala Leu Gin Tyr Asp Asp Asn Val Thr Asn 
465 470 475 480 

Pro Glu Asp Met Arg Pro Tyr Cys Trp His Tyr Pro Pro Lys Gin Cys 

485 490 495 

Gly Val Val Pro Ala Gly Thr Val Cys Gly Pro Val Tyr Cys Phe Thr 

500 505 510 

Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Leu Gly Val Pro Thr 

515 520 525 

Tyr Thr Trp Gly Glu Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr 

530 535 540 

Arg Pro Pro Ser Gly Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Thr 
545 550 555 560 

Gly Phe Thr Lys Thr Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp 

565 570 575 

Phe Asn Thr Ser Thr Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys 

580 585 590 

His Pro Glu Ala Thr Tyr He Lys Cys Gly Ser Gly Pro Trp Leu Thr 

595 600 605 

Pro Lys Cys Leu Val Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys 

610 615 620 

Thr Val Asn Tyr Ser Thr Phe Lys He Arg Met Tyr Val Gly Gly Val 
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625 630 635 640 

Glu His Arg Leu Met Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys 

645 650 655 

Asn Leu Glu Asp Arg Asp Arg Ser Gin Gin Thr Pro Leu Leu His Ser 

660 665 670 

Thr Thr Glu Trp Ala lie Leu Pro Cys Ser Phe Ser Asp Leu Pro Ala 

675 680 685 

Leu Ser Thr Gly Leu Leu His Leu His Gin Asn lie Val Asp Val Gin 

690 695 700 

Tyr Met Tyr Gly Leu Ser Pro Ala Leu Thr Gin Tyr lie Val Arg Trp 
705 710 715 720 

Glu Trp Val Val Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys 

725 730 735 

Ala Cys Leu Trp Met Leu lie Leu Leu Gly Gin Ala Glu Ala Ala Leu 

740 745 750 

Glu Lys Leu Val Val Leu His Ala Ala Ser Ala Ala Ser Cys Asn Gly 

755 760 765 

Phe Leu Tyr Phe Val He Phe Leu Val Ala Ala Trp His lie Lys Gly 

770 775 780 

Arg Val Val Pro Leu Ala Ala Tyr Ser Leu Thr Gly Leu Trp Pro Phe 
785 790 795 800 

Cys Leu Leu Leu Leu Ala Leu Pro Gin Gin Ala Tyr Ala Tyr Asp Ala 

805 810 815 

Ser Val His Gly Gin Val Gly Ala Ala Leu Leu Val Leu lie Thr Leu 

820 825 830 

Phe Thr Leu Thr Pro Gly Tyr Lys Thr Leu Leu Ser Gin Ser Leu Trp 

835 840 845 

Trp Leu Cys Tyr Leu Leu Thr Leu Ala Glu Thr Met Val Gin Glu Trp 
850 855 860 
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Ala Pro Ser Met Gin Ala Arg Gly Gly Arg Asp Gly lie He Trp Ala 
865 870 875 880 

Ala Thr He Phe Cys Pro Gly Val Val Phe Asp He Thr Lys Trp Leu 

885 890 895 

Leu Ala Val Leu Gly Pro Gly Tyr Leu Leu Arg Gly Ala Leu Thr Arg 

900 905 910 

Val Pro Tyr Phe Val Arg Ala His Ala Leu Leu Arg Met Cys Thr Met 

915 920 925 

Val Arg His Leu Ala Gly Gly Arg Tyr Val Gin Met Ala Leu Leu Ala 

930 935 940 

Leu Gly Arg Trp Thr Gly Thr Tyr lie Tyr Asp His Leu Thr Pro Met 
945 950 955 960 

Ser Asp Trp Ala Ala Ser Gly Leu Arg Asp Leu Ala Val Ala Val Glu 

965 970 975 

Pro He He Phe Ser Pro Met Glu Lys Lys Val He Val Trp Gly Ala 

980 985 990 

Glu Thr Ala Ala Cys Gly Asp He Leu His Gly Leu Pro Val Ser Ala 

995 1000 1005 

Arg Leu Gly Arg Glu He Leu Leu Gly Pro Ala Asp Gly Tyr Thr Ser 

1010 1015 1020 

Lys Gly Trp Lys Leu Leu Ala Pro He Thr Ala Tyr Ala Gin Gin Thr 
1025 1030 1035 1040 

Arg Gly Leu Leu Gly Ser He Val Val Ser Met Thr Gly Arg Asp Lys 

1045 1050 1055 

Thr Glu Gin Ala Gly Glu Val Gin Val Leu Ser Thr Val Thr Gin Ser 

1060 1065 1070 

Phe Leu Gly Thr Ser He Ser Gly Val Leu Trp Thr Val Tyr His Gly 

1075 1080 1085 

Ala Gly Asn Lys Thr Leu Ala Gly Ser Arg Gly Pro Val Thr Gin Met 
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1090 1095 1100 

Tyr Ser Ser Ala Glu Gly Asp Leu Val Gly Trp Pro Ser Pro Pro Gly 
1105 1110 1115 1120 

Thr Lys Ser Leu Glu Pro Cys Thr Cys Gly Ala Val Asp Leu Tyr Leu 

1125 1130 1135 

Val Thr Arg Asn Ala Asp Val lie Pro Ala Arg Arg Arg Gly Asp Lys 

1140 1145 1150 

Arg Gly Ala Leu Leu Ser Pro Arg Pro Leu Ser Thr Leu Lys Gly Ser 

1155 1160 1165 

Ser Gly Gly Pro Val Leu Cys Pro Arg Gly His Ala Val Gly He Phe 

1170 1175 1180 

Arg Ala Ala Val Cys Ser Arg Gly Val Ala Lys Ser He Asp Phe He 
1185 1190 1195 1200 

Pro Val Glu Thr Leu Asp He Val Thr Arg Ser Pro Thr Phe Ser Asp 

1205 1210 1215 

Asn Ser Thr Pro Pro Ala Val Pro Gin Thr Tyr Gin Val Gly Tyr Leu 

1220 1225 1230 

His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr 

1235 1240 1245 

Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala 

1250 1255 1260 

Thr Leu Gly Phe Gly Ala Tyr Leu Ser Lys Ala His Gly He Asn Pro 
1265 1270 1275 1280 

Asn He Arg Thr Gly Val Arg Thr Val Thr Thr Gly Glu Pro He Thr 

1285 1290 1295 

Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ala Gly Gly 

1300 1305 1310 

Ala Tyr Asp He He He Cys Asp Glu Cys His Ser Val Asp Ala Thr 
1315 1320 1325 
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Thr He Leu Gly lie Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly 

1330 1335 1340 

Val Arg Leu Thr Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr 
1345 1350 1355 1360 

Thr Pro His Pro Asn lie Glu Glu Val Ala Leu Gly Gin Glu Gly Glu 

1365 1370 1375 

He Pro Phe Tyr Gly Arg Ala Phe Pro Leu Ser Tyr He Lys Gly Gly 

1380 1385 1390 

Arg His Leu He Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala 

1395 1400 1405 

Thr Ala Leu Arg Gly Met Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly 

1410 1415 1420 

Leu Asp Val Ser He He Pro Thr Gin Gly Asp Val Val Val Val Ala 
1425 1430 1435 1440 

Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val He 

1445 1450 1455 

Asp Cys Asn Val Ala Val Thr Gin Ala Val Asp Phe Ser Leu Asp Pro 

1460 1465 1470 

Thr Phe Thr He Thr Thr Gin Thr Val Pro Gin Asp Ala Val Ser Arg 

1475 1480 1485 

Ser Gin Arg Arg Gly Arg Thr Gly Arg Gly Arg Leu Gly He Tyr Arg 

1490 1495 1500 

Tyr Val Ser Thr Gly Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val 
1505 1510 1515 1520 

Leu Cys Glu Cys Tyr Asp Ala Gly Ala Ala Trp Tyr Glu Leu Ser Pro 

1525 1530 1535 

Val Glu Thr Thr Val Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu 

1540 1545 1550 

Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Ala Val Phe Thr Gly 
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1555 1560 1565 

Leu Thr His He Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ser Gly 

1570 1575 1580 

Glu Asn Phe Ala Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg 
1585 1590 1595 1600 

Ala Lys Ala Pro Pro Pro Ser Trp Asp Val Met Trp Lys Cys Leu Thr 

1605 1610 1615 

Arg Leu Lys Pro Thr Leu Val Gly Pro Thr Pro Leu Leu Tyr Arg Leu 

1620 1625 1630 

Gly Ser Val Thr Asn Glu Val Thr Leu Thr His Pro Val Thr Lys Tyr 

1635 1640 1645 

He Ala Thr Cys Met Gin Ala Asp Leu Glu Val Met Thr Ser Thr Trp 

1650 1655 1660 

Val Leu Ala Gly Gly Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala 
1665 1670 1675 1680 

Thr Gly Cys Val Ser He He Gly Arg Leu His He Asn Gin Arg Ala 

1685 1690 1695 

Val Val Ala Pro Asp Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met 

1700 1705 1710 

Glu Glu Cys Ala Ser Arg Ala Ala Leu Leu Glu Glu Gly Gin Arg He 

1715 1720 1725 

Ala Glu Met Leu Lys Ser Lys He Gin Gly Leu Leu Gin Gin Ala Ser 

1730 1735 1740 

Lys Gin Ala Gin Asp He Gin Pro Ala Val Gin Ala Ser Trp Pro Lys 
1745 1750 1755 1760 

Met Glu Gin Phe Trp Ala Lys His Met Trp Asn Phe He Ser Gly He 

1765 1770 1775 

Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala 
1780 1785 1790 
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Ser Met Met Ala Phe Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser 

1795 1800 1805 

Thr Thr He Leu Leu Asn He Leu Gly Gly Trp Leu Ala Ser Gin He 

1810 1815 1820 

Ala Pro Pro Ala Gly Ala Thr Gly Phe Val Val Ser Gly Leu Val Gly 
1825 1830 1835 1840 

Ala Ala Val Gly Ser He Gly Leu Gly Lys Val Leu Val Asp He Leu 

1845 1850 1855 

Ala Gly Tyr Gly Ala Gly He Ser Gly Ala Leu Val Ala Phe Lys He 

1860 1865 1870 

Met Ser Gly Glu Lys Pro Ser Met Glu Asp Val He Asn Leu Leu Pro 

1875 1880 1885 

Gly He Leu Ser Pro Gly Ala Leu Val Val Gly Val He Cys Ala Ala 

1890 1895 1900 

He Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met 
1905 1910 1915 1920 

Asn Arg Leu He Ala Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr 

1925 1930 1935 

His Tyr Val Thr Glu Ser Asp Ala Ser Gin Arg Val Thr Gin Leu Leu 

1940 1945 1950 

Gly Ser Leu Thr He Thr Ser Leu Leu Arg Arg Leu His Asn Trp He 

1955 1960 1965 

Thr Glu Asp Cys Pro He Pro Cys Ala Gly Ser Trp Leu Arg Asp Val 

1970 1975 1980 

Trp Asp Trp Val Cys Thr He Leu Thr Asp Phe Lys Asn Trp Leu Thr 
1985 1990 1995 2000 

Ser Lys Leu Phe Pro Lys Met Pro Gly Leu Pro Phe Tie Ser Cys Gin 

2005 2010 2015 

Lys Gly Tyr Lys Gly Val Trp Ala Gly Thr Gly He Met Thr Thr Arg 
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2020 2025 2030 

Cys Pro Cys Gly Ala Asn He Ser Gly Asn Val Arg Leu Gly Ser Met 

2035 2040 2045 

Arg He Thr Gly Pro Lys Thr Cys Met Asn Thr Trp Gin Gly Thr Phe 

2050 2055 2060 

Pro He Asn Cys Tyr Thr Glu Gly Gin Cys Leu Pro Lys Pro Ala Leu 
2065 2070 2075 2080 

Asn Phe Lys Thr Ala He Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu 

2085 2090 2095 

Val Thr Gin His Gly Ser Tyr Ala Tyr He Thr Gly Leu Thr Thr Asp 

2100 2105 2110 

Asn Leu Lys Val Pro Cys Gin Leu Pro Ser Pro Glu Phe Phe Ser Trp 

2115 2120 2125 

Val Asp Gly Val Gin He His Arg Ser Ala Pro Thr Pro Lys Pro Phe 

2130 2135 2140 

Phe Arg Asp Glu Val Ser Phe Ser Val Gly Leu Asn Ser Phe Val Val 
2145 2150 2155 2160 

Gly Ser Gin Leu Pro Cys Asp Pro Glu Pro Asp Thr Glu Val Val Met 

2165 2170 2175 

Ser Met Leu Thr Asp Pro Ser His He Thr Ala Glu Ala Ala Ala Arg 

2180 2185 2190 

Arg Leu Ala Arg Gly Ser Pro Pro Ser Glu Ala Ser Ser Ser Ala Ser 

2195 2200 2205 

Gin Leu Ser Ala Pro Ser Leu Arg Ala Thr Cys Thr Thr His Gly Arg 

2210 2215 2220 

Thr Tyr Asp Val Asp Met Val Asp Ala Asn Leu Phe Met Gly Gly Gly 
2225 2230 2235 2240 

Val He Arg He Glu Ser Glu Ser Lys Val Val Val Leu Asp Ser Leu 
2245 2250 2255 
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Asp Ser Met Thr Glu Glu Glu Gly Asp Leu Glu Pro Ser Val Pro Ser 

2260 2265 2270 

Glu Tyr Met Leu Pro Arg Lys Arg Phe Pro Pro Ala Leu Pro Ala Trp 

2275 2280 2285 

Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu Ser Trp Lys Arg Pro 

2290 2295 2300 

Asp Tyr Gin Pro Pro Thr Val Ala Gly Cys Ala Leu Pro Pro Pro Lys 
2305 2310 2315 2320 

Lys Thr Pro Thr Pro Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser 

2325 2330 2335 

Glu Ser Thr He Gly Asp Ala Leu Gin Gin Leu Ala lie Lys Ser Phe 

2340 2345 2350 

Gly Gin Pro Pro Pro Ser Gly Asp Ser Gly Leu Ser Thr Gly Ala Asp 

2355 2360 2365 

Ala Ala Asp Ser Gly Asp Arg Thr Pro Pro Asp Glu Leu Ala Leu Ser 

2370 2375 2380 

Glu Thr Gly Ser Thr Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly 
2385 2390 2395 2400 

Asp Pro Asp Leu Glu Pro Glu Gin Val Glu Leu Gin Pro Pro Pro Gin 

2405 2410 2415 

Gly Gly Glu Ala Ala Pro Gly Ser Asp Ser Gly Ser Trp Ser Thr Cys 

2420 2425 2430 

Ser Glu Glu Asp Asp Ser Val Val Cys Cys Ser Met Ser Tyr Ser Trp 

2435 2440 2445 

Thr Gly Ala Leu He Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro 

2450 2455 2460 

He Asn Ser Leu Ser Asn Ser Leu Leu Arg Tyr His Asn Lys Val Tyr 
2465 2470 2475 2480 

Cys Thr Thr Ser Lys Ser Ala Ser Leu Arg Ala Lys Lys Val Thr Phe 
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2485 2490 2495 

Asp Arg Met Gin Val Leu Asp Ala Tyr Tyr Asp Ser Val Leu Lys Asp 

2500 2505 2510 

lie Lys Leu Ala Ala Ser Lys Val Ser Ala Arg Leu Leu Thr Leu Glu 

2515 2520 2525 

Glu Ala Cys Gin Leu Thr Pro Pro His Ser Ala Arg Ser Lys Tyr Gly 

2530 2535 2540 

Phe Gly Ala Lys Glu Val Arg Ser Leu Ser Gly Arg Ala Val Asn His 
2545 2550 2555 2560 

lie Lys Ser Val Trp Lys Asp Leu Leu Glu Asp Ser Gin Thr Pro lie 

2565 2570 2575 

Pro Thr Thr He Met Ala Lys Asn Glu Val Phe Cys Val Asp Pro Ala 

2580 2585 2590 

Lys Gly Gly Lys Lys Pro Ala Arg Leu He Val Tyr Pro Asp Leu Gly 

2595 2600 2605 

Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val Thr Gin Lys Leu 

2610 2615 2620 

Pro Gin Ala Val Met Gly Ala Ser Tyr Gly Phe Gin Tyr Ser Pro Ala 
2625 2630 2635 2640 

Gin Arg Val Glu Phe Leu Leu Lys Ala Trp Ala Glu Lys Arg Asp Pro 

2645 2650 2655 

Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu 

2660 2665 2670 

Arg Asp He Arg Thr Glu Glu Ser lie Tyr Gin Ala Cys Ser Leu Pro 

2675 2680 2685 

Glu Glu Ala Arg Thr Ala He His Ser Leu Thr Glu Arg Leu Tyr Val 

2690 2695 2700 

Gly Gly Pro Met Phe Asn Ser Lys Gly Gin Ser Cys Gly Tyr Arg Arg 
2705 2710 2715 2720 
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Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Met Gly Asn Thr lie Thr 

2725 2730 2735 

Cys Tyr Val Lys Ala Leu Ala Ala Cys Lys Ala Ala Gly lie He Ala 

2740 2745 2750 

Pro Thr Met Leu Val Cys Gly Asp Asp Leu Val Val lie Ser Glu Ser 

2755 2760 2765 

Gin Gly Thr Glu Glu Asp Glu Arg Asn Leu Arg Ala Phe Thr Glu Ala 

2770 2775 2780 

Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr 
2785 2790 2795 2800 

Asp Leu Glu Leu He Thr Ser Cys Ser Ser Asn Val Ser Val Ala Leu 

2805 2810 2815 

Gly Pro Gin Gly Arg Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr 

2820 2825 2830 

Ser He Ala Arg Ala Ala Trp Glu Thr Val Arg His Ser Pro Val Asn 

2835 2840 2845 

Ser Trp Leu Gly Asn He He Gin Tyr Ala Pro Thr He Trp Val Arg 

2850 2855 2860 

Met Val Leu Met Thr His Phe Phe Ser He Leu Met Ala Gin Asp Thr 
2865 2870 2875 2880 

Leu Asp Gin Asn Leu Asn Phe Glu Met Tyr Gly Ser Val Tyr Ser Val 

2885 2890 2895 

Ser Pro Leu Asp Leu Pro Ala He He Glu Arg Leu His Gly Leu Asp 

2900 2905 2910 

Ala Phe Ser Leu His Thr Tyr Thr Pro His Glu Leu Thr Arg Val Ala 

2915 2920 2925 

Ser Ala Leu Arg Lys Leu Gly Ala Pro Pro Leu Arg Ala Trp Lys Ser 

2930 2935 2940 

Arg Ala Arg Ala Val Arg Ala Ser Leu He Ser Arg Gly Gly Arg Ala 
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2945 2950 2955 2960 

Ala Val Cys Gly Arg Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu 

2965 2970 2975 

Lys Leu Thr Pro Leu Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp 

2980 2985 2990 

Phe Thr Val Gly Ala Gly Gly Gly Asp He Tyr His Ser Val Ser Arg 

2995 3000 3005 

Ala Arg Pro Arg Leu Leu Leu Leu Ser Leu Leu Leu Leu Ser Val Gly 

3010 3015 3020 

Val Gly Leu Phe Leu Leu Pro Ala Arg 
3025 3030 



<210> 7 
<211> 8024 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: replicon 
<400> 7 

accugccccu aauaggggcg acacuccgcc augaaucacu ccccugugag gaacuacugu 60 
cuucacgcag aaagcgccua gccauggcgu uaguaugagu gucguacagc cuccaggccc 120 
cccccucccg ggagagccau aguggucugc ggaaccggug aguacaccgg aauugccggg 180 
aagacugggu ccuuucuugg auaaacccac ucuaugcccg gccauuuggg cgugcccccg 240 
caagacugcu agccgaguag cguuggguug cgaaaggccu ugugguacug ccugauaggg 300 
cgcuugcgag ugccccggga ggucucguag accgugcacc augagcacaa auccuaaacc 360 
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ucaaagaaaa accaaaagaa acaccaaccg ucgcccaaug auugaacaag auggauugca 420 
cgcagguucu ccggccgcuu ggguggagag gcuauucggc uaugacuggg cacaacagac 480 
aaucggcugc ucugaugccg ccguguuccg gcugucagcg caggggcgcc cgguucuuuu 540 
ugucaagacc gaccuguccg gugcccugaa ugaacugcag gacgaggcag cgcggcuauc 600 
guggcuggcc acgacgggcg uuccuugcgc agcugugcuc gacguuguca cugaagcggg 660 
aagggacugg cugcuauugg gcgaagugcc ggggcaggau cuccugucau cucaccuugc 720 
uccugccgag aaaguaucca ucauggcuga ugcaaugcgg cggcugcaua cgcuugaucc 780 
ggcuaccugc ccauucgacc accaagcgaa acaucgcauc gagcgagcac guacucggau 840 
ggaagccggu cuugucgauc aggaugaucu ggacgaagag caucaggggc ucgcgccagc 900 
cgaacuguuc gccaggcuca aggcgcgcau gcccgacggc gaggaucucg ucgugaccca 960 
uggcgaugcc ugcuugccga auaucauggu ggaaaauggc cgcuuuucug gauucaucga 1020 
cuguggccgg cugggugugg cggaccgcua ucaggacaua gcguuggcua cccgugauau 1080 
ugcugaagag cuuggcggcg aaugggcuga ccgcuuccuc gugcuuuacg guaucgccgc 1140 
ucccgauucg cagcgcaucg ccuucuaucg ccuucuugac gaguucuucu gaguuuaaac 1200 
ccucucccuc cccccccccu aacguuacug gccgaagccg cuuggaauaa ggccggugug 1260 
cguuugucua uauguuauuu uccaccauau ugccgucuuu uggcaaugug agggcccgga 1320 
aaccuggccc ugucuucuug acgagcauuc cuaggggucu uuccccucuc gccaaaggaa 1380 
ugcaaggucu guugaauguc gugaaggaag caguuccucu ggaagcuucu ugaagacaaa 1440 
caacgucugu agcgacccuu ugcaggcagc ggaacccccc accuggcgac aggugccucu 1500 
gcggccaaaa gccacgugua uaagauacac cugcaaaggc ggcacaaccc cagugccacg 1560 
uugugaguug gauaguugug gaaagaguca aauggcucuc cucaagcgua uucaacaagg 1620 
ggcugaagga ugcccagaag guaccccauu guaugggauc ugaucugggg ccucggugca 1680 
caugcuuuac auguguuuag ucgagguuaa aaaaacgucu aggccccccg aaccacgggg 1740 
acgugguuuu ccuuugaaaa acacgaugau accauggcuc ccaucacugc uuaugcccag 1800 
caaacacgag gccuccuggg cgccauagug gugaguauga cggggcguga caggacagaa 1860 
caggccgggg aaguccaaau ccuguccaca gucucucagu ccuuccucgg aacaaccauc 1920 
ucggggguuu uguggacugu uuaccacgga gcuggcaaca agacucuagc cggcuuacgg 1980 
gguccgguca cgcagaugua cucgagugcu gagggggacu ugguaggcug gcccagcccc 2040 
ccugggacca agucuuugga gccgugcaag uguggagccg ucgaccuaua ucuggucacg 2100 
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cggaacgcug augucauccc ggcucggaga cgcggggaca agcggggagc auugcucucc 2160 
ccgagaccca uuucgaccuu gaaggggucc ucgggggggc cggugcucug cccuaggggc 2220 
cacgucguug ggcucuuccg agcagcugug ugcucucggg gcguggccaa auccaucgau 2280 
uucauccccg uugagacacu cgacguuguu acaaggucuc ccacuuucag ugacaacagc 2340 
acgccaccgg cugugcccca gaccuaucag gucggguacu ugcaugcucc aacuggcagu 2400 
ggaaagagca ccaagguccc ugucgcguau gccgcccagg gguacaaagu acuagugcuu 2460 
aaccccucgg uagcugccac ccugggguuu ggggcguacc uauccaaggc acauggcauc 2520 
aaucccaaca uuaggacugg agucaggacc gugaugaccg gggaggccau cacguacucc 2580 
acauauggca aauuucucgc cgaugggggc ugcgcuagcg gcgccuauga caucaucaua 2640 
ugcgaugaau gccacgcugu ggaugcuacc uccauucucg gcaucggaac gguccuugau 2700 
caagcagaga cagccggggu cagacuaacu gugcuggcua cggccacacc ccccggguca 2760 
gugacaaccc cccaucccga uauagaaaag guaggccucg ggcgggaggg ugagaucccc 2820 
uucuauggga gggcgauucc ccuauccugc aucaagggag ggagacaccu gauuuucugc 2880 
cacucaaaga aaaaguguga cgagcucgcg gcggcccuuc ggggcauggg cuugaaugcc 2940 
guggcauacu auagaggguu ggacgucucc auaauaccag cucagggaga uguggugguc 3000 
gucgccaccg acgcccucau gacgggguac acuggagacu uugacuccgu gaucgacugc 3060 
aauguagcgg ucacccaagc ugucgacuuc agccuggacc ccaccuucac uauaaccaca 3120 
cagacugucc cacaagacgc ugucucacgc agucagcgcc gcgggcgcac agguagagga 3180 
agacagggca cuuauaggua uguuuccacu ggugaacgag ccucaggaau guuugacagu 3240 
guagugcuuu gugagugcua cgacgcaggg gcugcguggu acgaucucac accagcggag 3300 
accaccguca ggcuuagagc guauuucaac acgcccggcc uacccgugug ucaagaccau 3360 
cuugaauuuu gggaggcagu uuucaccggc cucacacaca uagacgccca cuuccucucc 3420 
caaacaaagc aagcggggga gaacuucgcg uaccuaguag ccuaccaagc uacggugugc 3480 
gccagagcca aggccccucc cccguccugg gacgccaugu ggaagugccu ggcccgacuc 3540 
aagccuacgc uugcgggccc cacaccucuc cuguaccguu ugggcccuau uaccaaugag 3600 
gucacccuca cacacccugg gacgaaguac aucgccacau gcaugcaagc ugaccuugag 3660 
gucaugacca gcacgugggu ccuagcugga ggaguccugg cagccgucgc cgcauauugc 3720 
cuggcgacug gaugcguuuc caucaucggc cgcuugcacg ucaaccagcg agucgucguu 3780 
gcgccggaua aggagguccu guaugaggcu uuugaugaga uggaggaaug cgccucuagg 3840 
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gcggcucuca ucgaagaggg gcagcggaua gccgagaugu ugaaguccaa gauccaaggc 3900 
uugcugcagc aggccucuaa gcaggcccag gacauacaac ccgcuaugca ggcuucaugg 3960 
cccaaagugg aacaauuuug ggccagacac auguggaacu ucauuagcgg cauccaauac 4020 
cucgcaggau ugucaacacu gccagggaac cccgcggugg cuuccaugau ggcauucagu 4080 
gccgcccuca ccaguccguu gucgaccagu accaccaucc uucucaacau caugggaggc 4140 
ugguuagcgu cccagaucgc accacccgcg ggggccaccg gcuuugucgu caguggccug 4200 
gugggggcug ccgugggcag cauaggccug gguaaggugc ugguggacau ccuggcagga 4260 
uauggugcgg gcauuucggg ggcccucguc gcauucaaga ucaugucugg cgagaagccc 4320 
ucuauggaag augucaucaa ucuacugccu gggauccugu cuccgggagc ccugguggug 4380 
ggggucaucu gcgcggccau ucugcgccgc cacgugggac cgggggaggg cgcgguccaa 4440 
uggaugaaca ggcuuauugc cuuugcuucc agaggaaacc acgucgcccc uacucacuac 4500 
gugacggagu cggaugcguc gcagcgugug acccaacuac uuggcucucu uacuauaacc 4560 
agccuacuca gaagacucca caauuggaua acugaggacu gccccauccc augcuccgga 4620 
uccuggcucc gcgacgugug ggacuggguu ugcaccaucu ugacagacuu caaaaauugg 4680 
cugaccucua aauuguuccc caagcugccc ggccuccccu ucaucucuug ucaaaagggg 4740 
uacaagggug ugugggccgg cacuggcauc augaccacgc gcugcccuug cggcgccaac 4800 
aucucuggca auguccgccu gggcucuaug aggaucacag ggccuaaaac cugcaugaac 4860 
accuggcagg ggaccuuucc uaucaauugc uacacggagg gccagugcgc gccgaaaccc 4920 
cccacgaacu acaagaccgc caucuggagg guggcggccu cggaguacgc ggaggugacg 4980 
cagcaugggu cguacuccua uguaacagga cugaccacug acaaucugaa aauuccuugc 5040 
caacuaccuu cuccagaguu uuucuccugg guggacggug ugcagaucca uagguuugca 5100 
cccacaccaa agccguuuuu ccgggaugag gucucguucu gcguugggcu uaauuccuau 5160 
gcugucgggu cccagcuucc cugugaaccu gagcccgacg cagacguauu gagguccaug 5220 
cuaacagauc cgccccacau cacggcggag acugcggcgc ggcgcuuggc acggggauca 5280 
ccuccaucug aggcgagcuc cucagugagc cagcuaucag caccgucgcu gcgggccacc 5340 
ugcaccaccc acagcaacac cuaugacgug gacauggucg augccaaccu gcucauggag 5400 
ggcggugugg cucagacaga gccugagucc agggugcccg uucuggacuu ucucgagcca 5460 
auggccgagg aagagagcga ccuugagccc ucaauaccau cggagugcau gcuccccagg 5520 
agcggguuuc cacgggccuu accggcuugg gcacggccug acuacaaccc gccgcucgug 5580 
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gaaucgugga ggaggccaga uuaccaaccg cccaccguug cugguugugc ucuccccccc 5640 
cccaagaagg ccccgacgcc ucccccaagg agacgccgga cagugggucu gagcgagagc 5700 
accauaucag aagcccucca gcaacuggcc aucaagaccu uuggccagcc ccccucgagc 5760 
ggugaugcag gcucguccac gggggcgggc gccgccgaau ccggcggucc gacguccccu 5820 
ggugagccgg cccccucaga gacagguucc gccuccucua ugcccccccu cgagggggag 5880 
ccuggagauc cggaccugga gucugaucag guagagcuuc aaccuccccc ccaggggggg 5940 
gggguagcuc ccgguucggg cucggggucu uggucuacuu gcuccgagga ggacgauacc 6000 
accgugugcu gcuccauguc auacuccugg accggggcuc uaauaacucc cuguagcccc 6060 
gaagaggaaa aguugccaau caacccuuug aguaacucgc uguugcgaua ccauaacaag 6120 
guguacugua caacaucaaa gagcgccuca cagagggcua aaaagguaac uuuugacagg 6180 
acgcaagugc ucgacgccca uuaugacuca gucuuaaagg acaucaagcu agcggcuucc 6240 
aaggucagcg caaggcuccu caccuuggag gaggcgugcc aguugacucc accccauucu 6300 
gcaagaucca aguauggauu cggggccaag gagguccgca gcuuguccgg gagggccguu 6360 
aaccacauca aguccgugug gaaggaccuc cuggaagacc cacaaacacc aauucccaca 6420 
accaucaugg ccaaaaauga gguguucugc guggaccccg ccaagggggg uaagaaacca 6480 
gcucgccuca ucguuuaccc ugaccucggc guccgggucu gcgagaaaau ggcccucuau 6540 
gacauuacac aaaagcuucc ucaggcggua augggagcuu ccuauggcuu ccaguacucc 6600 
ccugcccaac ggguggagua ucucuugaaa gcaugggcgg aaaagaagga ccccaugggu 6660 
uuuucguaug auacccgaug cuucgacuca accgucacug agagagacau caggaccgag 6720 
gaguccauau accaggccug cucccugccc gaggaggccc gcacugccau acacucgcug 6780 
acugagagac uuuacguagg agggcccaug uucaacagca agggucaaac cugcgguuac 6840 
agacguugcc gcgccagcgg ggugcuaacc acuagcaugg guaacaccau cacaugcuau 6900 
gugaaagccc uagcggccug caaggcugcg gggauaguug cgcccacaau gcugguaugc 6960 
ggcaaugacc uaguagucau cucagaaagc caggggacug aggaggacga gcggaaccug 7020 
agagccuuca cggaggccau gaccagguac ucugccccuc cuggugaucc ccccagaccg 7080 
gaauaugacc uggagcuaau aacauccugu uccucaaaug ugucuguggc guugggcccg 7140 
cggggccgcc gcagauacua ccugaccaga gacccaacca cuccacucgc ccgggcugcc 7200 
ugggaaacag uuagacacuc cccuaucaau ucauggcugg gaaacaucau ccaguaugcu 7260 
ccaaccauau ggguucgcau gguccuaaug acacacuucu ucuccauucu caugguccaa 7320 
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gacacccugg 


accagaaccu 


caacuuugag 


auguauggau 


caguauacuc 


cgugaauccu 


7380 


uuggaccuuc 


cagccauaau 


ugagagguua 


cacgggcuug 


acgccuuuuc 


uaugcacaca 


7440 


uacucucacc 


acgaacugac 


gcggguggcu 


ucagcccuca 


gaaaacuugg 


ggcgccaccc 


7500 


cucagggugu 


ggaagagucg 


ggcucgcgca 


gucagggcgu 


cccucaucuc 


ccguggaggg 


7560 


aaagcggccg 


uuugcggccg 


auaucucuuc 


aauugggcgg 


ugaagaccaa 


gcucaaacuc 


7620 


acuccauugc 


cggaggcgcg 


ccuacuggac 


uuauccaguu 


gguucaccgu 


cggcgccggc 


7680 


gggggcgaca 


uuuuucacag 


cgugucgcgc 


gcccgacccc 


gcucauuacu 


cuucggccua 


7740 


cuccuacuuu 


ucguaggggu 


aggccucuuc 


cuacuccccg 


cucgguagag 


cggcacacac 


7800 


UdggUdbdLU 


LbdUdglUdd 


cuguuccuuu 


11111111111111111111 

UUUUUUUUUU 


11111111111111111111 

UUUUUUUUUU 


11111111111111111111 

UUUUUUUUUU 




UUUUUUUUUU 


cuuuuuuuuu 


uuuuucccuc 


uuucuucccu 


ucucaucuua 


uucuacuuuc 


7920 


uuucuuggug 


gcuccaucuu 


agcccuaguc 


acggcuagcu 


gugaaagguc 


cgugagccgc 


7980 


augacugcag 


agagugccgu 


aacuggucuc 


ucugcagauc 


augu 




8024 



<210> 8 
<211> 7994 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: replicon 
<400> 8 

accugccccu aauaggggcg acacuccgcc augaaucacu ccccugugag gaacuacugu 60 
cuucacgcag aaagcgccua gccauggcgu uaguaugagu gucguacagc cuccaggccc 120 
cccccucccg ggagagccau aguggucugc ggaaccggug aguacaccgg aauugccggg 180 
aagacugggu ccuuucuugg auaaacccac ucuaugcccg gccauuuggg cgugcccccg 240 
caagacugcu agccgaguag cguuggguug cgaaaggccu ugugguacug ccugauaggg 300 
cgcuugcgag ugccccggga ggucucguag accgugcacc augagcacaa auccuaaacc 360 
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ucaaagaaaa accaaaagaa acaccaaccg ucgcccaaug auugaacaag auggauugca 420 
cgcagguucu ccggccgcuu ggguggagag gcuauucggc uaugacuggg cacaacagac 480 
aaucggcugc ucugaugccg ccguguuccg gcugucagcg caggggcgcc cgguucuuuu 540 
ugucaagacc gaccuguccg gugcccugaa ugaacugcag gacgaggcag cgcggcuauc 600 
guggcuggcc acgacgggcg uuccuugcgc agcugugcuc gacguuguca cugaagcggg 660 
aagggacugg cugcuauugg gcgaagugcc ggggcaggau cuccugucau cucaccuugc 720 
uccugccgag aaaguaucca ucauggcuga ugcaaugcgg cggcugcaua cgcuugaucc 780 
ggcuaccugc ccauucgacc accaagcgaa acaucgcauc gagcgagcac guacucggau 840 
ggaagccggu cuugucgauc aggaugaucu ggacgaagag caucaggggc ucgcgccagc 900 
cgaacuguuc gccaggcuca aggcgcgcau gcccgacggc gaggaucucg ucgugaccca 960 
uggcgaugcc ugcuugccga auaucauggu ggaaaauggc cgcuuuucug gauucaucga 1020 
cuguggccgg cugggugugg cggaccgcua ucaggacaua gcguuggcua cccgugauau 1080 
ugcugaagag cuuggcggcg aaugggcuga ccgcuuccuc gugcuuuacg guaucgccgc 1140 
ucccgauucg cagcgcaucg ccuucuaucg ccuucuugac gaguucuucu gaguuuaaac 1200 
ccucucccuc cccccccccu aacguuacug gccgaagccg cuuggaauaa ggccggugug 1260 
cguuugucua uauguuauuu uccaccauau ugccgucuuu uggcaaugug agggcccgga 1320 
aaccuggccc ugucuucuug acgagcauuc cuaggggucu uuccccucuc gccaaaggaa 1380 
ugcaaggucu guugaauguc gugaaggaag caguuccucu ggaagcuucu ugaagacaaa 1440 
caacgucugu agcgacccuu ugcaggcagc ggaacccccc accuggcgac aggugccucu 1500 
gcggccaaaa gccacgugua uaagauacac cugcaaaggc ggcacaaccc cagugccacg 1560 
uugugaguug gauaguugug gaaagaguca aauggcucuc cucaagcgua uucaacaagg 1620 
ggcugaagga ugcccagaag guaccccauu guaugggauc ugaucugggg ccucggugca 1680 
caugcuuuac auguguuuag ucgagguuaa aaaaacgucu aggccccccg aaccacgggg 1740 
acgugguuuu ccuuugaaaa acacgaugau accauggcuc ccaucacugc uuaugcccag 1800 
caaacacgag gccuccuggg cgccauagug gugaguauga cggggcguga caggacagaa 1860 
caggccgggg aaguccaaau ccuguccaca gucucucagu ccuuccucgg aacaaccauc 1920 
ucggggguuu uguggacugu uuaccacgga gcuggcaaca agacucuagc cggcuuacgg 1980 
gguccgguca cgcagaugua cucgagugcu gagggggacu ugguaggcug gcccagcccc 2040 
ccugggacca agucuuugga gccgugcaag uguggagccg ucgaccuaua ucuggucacg 2100 
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cggaacgcug augucauccc ggcucggaga cgcggggaca agcggggagc auugcucucc 2160 
ccgagaccca uuucgaccuu gaaggggucc ucgggggggc cggugcucug cccuaggggc 2220 
cacgucguug ggcucuuccg agcagcugug ugcucucggg gcguggccaa auccaucgau 2280 
uucauccccg uugagacacu cgacguuguu acaaggucuc ccacuuucag ugacaacagc 2340 
acgccaccgg cugugcccca gaccuaucag gucggguacu ugcaugcucc aacuggcagu 2400 
ggaaagagca ccaagguccc ugucgcguau gccgcccagg gguacaaagu acuagugcuu 2460 
aaccccucgg uagcugccac ccugggguuu ggggcguacc uauccaaggc acauggcauc 2520 
aaucccaaca uuaggacugg agucaggacc gugaugaccg gggaggccau cacguacucc 2580 
acauauggca aauuucucgc cgaugggggc ugcgcuagcg gcgccuauga caucaucaua 2640 
ugcgaugaau gccacgcugu ggaugcuacc uccauucucg gcaucggaac gguccuugau 2700 
caagcagaga cagccggggu cagacuaacu gugcuggcua cggccacacc ccccggguca 2760 
gugacaaccc cccaucccga uauagaagag guaggccucg ggcgggaggg ugagaucccc 2820 
uucuauggga gggcgauucc ccuauccugc aucaagggag ggagacaccu gauuuucugc 2880 
cacucaaaga aaaaguguga cgagcucgcg gcggcccuuc ggggcauggg cuugaaugcc 2940 
guggcauacu auagaggguu ggacgucucc auaauaccag cucagggaga uguggugguc 3000 
gucgccaccg acgcccucau gacgggguac acuggagacu uugacuccgu gaucgacugc 3060 
aauguagcgg ucacccaagc ugucgacuuc agccuggacc ccaccuucac uauaaccaca 3120 
cagacugucc cacaagacgc ugucucacgc agucagcgcc gcgggcgcac agguagagga 3180 
agacagggca cuuauaggua uguuuccacu ggugaacgag ccucaggaau guuugacagu 3240 
guagugcuuu gugagugcua cgacgcaggg gcugcguggu acgaucucac accagcggag 3300 
accaccguca ggcuuagagc guauuucaac acgcccggcc uacccgugug ucaagaccau 3360 
cuugaauuuu gggaggcagu uuucaccggc cucacacaca uagacgccca cuuccucucc 3420 
caaacaaagc aagcggggga gaacuucgcg uaccuaguag ccuaccaagc uacggugugc 3480 
gccagagcca aggccccucc cccguccugg gacgccaugu ggaagugccu ggcccgacuc 3540 
aagccuacgc uugcgggccc cacaccucuc cuguaccguu ugggcccuau uaccaaugag 3600 
gucacccuca cacacccugg gacgaaguac aucgccacau gcaugcaagc ugaccuugag 3660 
gucaugacca gcacgugggu ccuagcugga ggaguccugg cagccgucgc cgcauauugc 3720 
cuggcgacug gaugcguuuc caucaucggc cgcuugcacg ucaaccagcg agucgucguu 3780 
gcgccggaua aggagguccu guaugaggcu uuugaugaga uggaggaaug cgccucuagg 3840 
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gcggcucuca ucgaagaggg gcagcggaua gccgagaugu ugaaguccaa gauccaaggc 3900 

uugcugcagc aggccucuaa gcaggcccag gacauacaac ccgcuaugca ggcuucaugg 3960 

cccaaagugg aacaauuuug ggccagacac auguggaacu ucauuagcgg cauccaauac 4020 

cucgcaggau ugucaacacu gccagggaac cccgcggugg cuuccaugau ggcauucagu 4080 

gccgcccuca ccaguccguu gucgaccagu accaccaucc uucucaacau caugggaggc 4140 

ugguuagcgu cccagaucgc accacccgcg ggggccaccg gcuuugucgu caguggccug 4200 

gugggggcug ccgugggcag cauaggccug gguaaggugc ugguggacau ccuggcagga 4260 

uauggugcgg gcauuucggg ggcccucguc gcauucaaga ucaugucugg cgagaagccc 4320 

ucuauggaag augucaucaa ucuacugccu gggauccugu cuccgggagc ccugguggug 4380 

ggggucaucu gcgcggccau ucugcgccgc cacgugggac cgggggaggg cgcgguccaa 4440 

uggaugaaca ggcuuauugc cuuugcuucc agaggaaacc acgucgcccc uacucacuac 4500 

gugacggagu cggaugcguc gcagcgugug acccaacuac uuggcucucu uacuauaacc 4560 

agccuacuca gaagacucca caauuggaua acugaggacu gccccauccc augcuccgga 4620 

uccuggcucc gcgacgugug ggacuggguu ugcaccaucu ugacagacuu caaaaauugg 4680 

cugaccucua aauuguuccc caagcugccc ggccuccccu ucaucucuug ucaaaagggg 4740 

uacaagggug ugugggccgg cacuggcauc augaccacgc gcugcccuug cggcgccaac 4800 

aucucuggca auguccgccu gggcucuaug aggaucacag ggccuaaaac cugcaugaac 4860 

accuggcagg ggaccuuucc uaucaauugc uacacggagg gccagugcgc gccgaaaccc 4920 

cccacgaacu acaagaccgc caucuggagg guggcggccu cggaguacgc ggaggugacg 4980 

cagcaugggu cguacuccua uguaacagga cugaccacug acaaucugaa aauuccuugc 5040 

caacuaccuu cuccagaguu uuucuccugg guggacggug ugcagaucca uagguuugca 5100 

cccacaccaa agccguuuuu ccgggaugag gucucguucu gcguugggcu uaauuccuau 5160 

gcugucgggu cccagcuucc cugugaaccu gagcccgacg cagacguauu gagguccaug 5220 

cuaacagauc cgccccacau cacggcggag acugcggcgc ggcgcuuggc acggggauca 5280 

ccuccaucug aggcgagcuc cucagugagc cagcuaucag caccgucgcu gcgggccacc 5340 

ugcaccaccc acagcaacac cuaugacgug gacauggucg augccaaccu gcucauggag 5400 

ggcggugugg cucagacaga gccugagucc agggugcccg uucuggacuu ucucgagcca 5460 

auggccgagg aagagagcga ccuugagccc ucaauaccau cggagugcau gcuccccagg 5520 

agcggguuuc cacgggccuu accggcuugg gcacggccug acuacaaccc gccgcucgug 5580 
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gaaucgugga ggaggccaga uuaccaaccg cccaccguug cugguugugc ucuccccccc 5640 
cccaagaagg ccccgacgcc ucccccaagg agacgccgga cagugggucu gagcgagagc 5700 
accauaucag aagcccucca gcaacuggcc aucaagaccu uuggccagcc ccccucgagc 5760 
ggugaugcag gcucguccac gggggcgggc gccgccgaau ccggcggucc gacguccccu 5820 
ggugagccgg cccccucaga gacagguucc gccuccucua ugcccccccu cgagggggag 5880 
ccuggagauc cggaccugga gucugaucag guagagcuuc aaccuccccc ccaggggggg 5940 
gggguagcuc ccgguucggg cucggggucu uggucuacuu gcuccgagga ggacgauacc 6000 
accgugugcu gcuccauguc auacuccugg accggggcuc uaauaacucc cuguagcccc 6060 
gaagaggaaa aguugccaau caacccuuug aguaacucgc uguugcgaua ccauaacaag 6120 
guguacugua caacaucaaa gagcgccuca cagagggcua aaaagguaac uuuugacagg 6180 
acgcaagugc ucgacgccca uuaugacuca gucuuaaagg acaucaagcu agcggcuucc 6240 
aaggucagcg caaggcuccu caccuuggag gaggcgugcc aguugacucc accccauucu 6300 
gcaagaucca aguauggauu cggggccaag gagguccgca gcuuguccgg gagggccguu 6360 
aaccacauca aguccgugug gaaggaccuc cuggaagacc cacaaacacc aauucccaca 6420 
accaucaugg ccaaaaauga gguguucugc guggaccccg ccaagggggg uaagaaacca 6480 
gcucgccuca ucguuuaccc ugaccucggc guccgggucu gcgagaaaau ggcccucuau 6540 
gacauuacac aaaagcuucc ucaggcggua augggagcuu ccuauggcuu ccaguacucc 6600 
ccugcccaac ggguggagua ucucuugaaa gcaugggcgg aaaagaagga ccccaugggu 6660 
uuuucguaug auacccgaug cuucgacuca accgucacug agagagacau caggaccgag 6720 
gaguccauau accaggccug cucccugccc gaggaggccc gcacugccau acacucgcug 6780 
acugagagac uuuacguagg agggcccaug uucaacagca agggucaaac cugcgguuac 6840 
agacguugcc gcgccagcgg ggugcuaacc acuagcaugg guaacaccau cacaugcuau 6900 
gugaaagccc uagcggccug caaggcugcg gggauaguug cgcccacaau cucagaaagc 6960 
caggggacug aggaggacga gcggaaccug agagccuuca cggaggccau gaccagguac 7020 
ucugccccuc cuggugaucc ccccagaccg gaauaugacc uggagcuaau aacauccugu 7080 
uccucaaaug ugucuguggc guugggcccg cggggccgcc gcagauacua ccugaccaga 7140 
gacccaacca cuccacucgc ccggg.cugcc ugggaaacag uuagacacuc cccuaucaau 7200 
ucauggcugg gaaacaucau ccaguaugcu ccaaccauau ggguucgcau gguccuaaug 7260 
acacacuucu ucuccauucu caugguccaa gacacccugg accagaaccu caacuuugag 7320 
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auguauggau caguauacuc cgugaauccu uuggaccuuc cagccauaau ugagagguua 7380 

cacgggcuug acgccuuuuc uaugcacaca uacucucacc acgaacugac gcggguggcu 7440 

ucagcccuca gaaaacuugg ggcgccaccc cucagggugu ggaagagucg ggcucgcgca 7500 

gucagggcgu cccucaucuc ccguggaggg aaagcggccg uuugcggccg auaucucuuc 7560 

aauugggcgg ugaagaccaa gcucaaacuc acuccauugc cggaggcgcg ccuacuggac 7620 

uuauccaguu gguucaccgu cggcgccggc gggggcgaca uuuuucacag cgugucgcgc 7680 

gcccgacccc gcucauuacu cuucggccua cuccuacuuu ucguaggggu aggccucuuc 7740 

cuacuccccg cucgguagag cggcacacac uagguacacu ccauagcuaa cuguuccuuu 7800 

uuuuuuuuuu uuuuuuuuuu uuuuuuuuuu uuuuuuuuuu cuuuuuuuuu uuuuucccuc 7860 

uuucuucccu ucucaucuua uucuacuuuc uuucuuggug gcuccaucuu agcccuaguc 7920 

acggcuagcu gugaaagguc cgugagccgc augacugcag agagugccgu aacuggucuc 7980 

ucugcagauc augu 7994 



<210> 9 
<211> 340 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 9 

accugccccu aauaggggcg acacuccgcc 
cuucacgcag aaagcgccua gccauggcgu 
cccccucccg ggagagccau aguggucugc 
aagacugggu ccuuucuugg auaaacccac 
caagacugcu agccgaguag cguuggguug 
cgcuugcgag ugccccggga ggucucguag 



Sequence: synthetic RNA 

augaaucacu ccccugugag gaacuacugu 60 

uaguaugagu gucguacagc cuccaggccc 120 

ggaaccggug aguacaccgg aauugccggg 180 

ucuaugcccg gccauuuggg cgugcccccg 240 

cgaaaggccu ugugguacug ccugauaggg 300 

accgugcacc 340 
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<210> 10 
<211> 340 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic RNA 
<400> 10 

acccgccccu aauaggggcg acacuccgcc augaaucacu ccccugugag gaacuacugu 60 
cuucacgcag aaagcgucua gccauggcgu uaguaugagu gucguacagc cuccaggccc 120 
cccccucccg ggagagccau aguggucugc ggaaccggug aguacaccgg aauugccggg 180 
aagacugggu ccuuucuugg auaaacccac ucuaugcccg gccauuuggg cgugcccccg 240 
caagacugcu agccgaguag cguuggguug cgaaaggccu ugugguacug ccugauaggg 300 
ugcuugcgag ugccccggga ggucucguag accgugcacc 340 



<210> 11 

<211> 236 

<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic RNA 



<400> 11 

agcggcacac acuagguaca cuccauagcu aacuguuccu uuuuuuuuuu uuuuuuuuuu 60 
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uuuuuuuuuu uuuuuuuuuu uucuuuuuuu 
uauucuacuu ucuuucuugg uggcuccauc 
uccgugagcc gcaugacugc agagagugcc 



uuuuuuuccc ucuuucuucc cuucucaucu 120 
uuagcccuag ucacggcuag cugugaaagg 180 
guaacugguc ucucugcaga ucaugu 236 



<210> 12 
<211> 232 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic RNA 
<400> 12 

agcggcacac auuagcuaca cuccauagcu aacuguuccu uuuuuuuuuu uuuuuuuuuu 60 
uuuuuuuuuu uuuuuuucuu uuuuuuuuuu uuucccucuu ucuucccuuc ucaucuuauu 120 
cuacuuucuu ucuugguggc uccaucuuag cccuggucac ggcuagcugu gaaagguccg 180 
ugagccgcau gacugcagag agugccguaa cuggucucuc ugcagaucau gu 232 



<210> 13 

<211> 17 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 

<400> 13 
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cgggagagcc atagtgg 17 



<210> 14 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 

<400> 14 

agtaccacaa ggcctttcg 19 



<210> 15 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 15 

ctgcggaacc ggtgagtaca c 21 

<210> 16 
<211> 20 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 16 

aacaagatgg attgcacgca 20 

<210> 17 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 17 

cgtcaagaag gcgatagaag 20 

<210> 18 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
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<400> 18 

gcactctctg cagtcatgcg gctcacggac 30 



<210> 19 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 19 

cccctgtgag gaactactgt cttcacgc 28 



<210> 20 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 20 

ccgggagagc catagtggtc tgcg 24 



<210> 21 
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<211> 30 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 21 

ccactcaaag aaaaagtgtg acgagctcgc 30 



<210> 22 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 

<400> 22 

ggcttgggca cggcctga 18 



<210> 23 

<211> 30 

<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence: synthetic DNA 
<400> 23 

gcggtgaaga ccaagctcaa actcactcca 30 

<210> 24 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 24 

agaacctgcg tgcaatccat c 21 

<210> 25 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 25 

cccgtcatga gggcgtcggt ggc 23 
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<210> 26 

<211> 27 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 26 

accagcaacg gtgggcggtt ggtaatc 27 



<210> 27 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 

<400> 27 

ggcacgcgac acgctgtg 18 



<210> 28 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: synthetic DNA 



<400> 28 

agctagccgt gactagggct aagatggagc 30 

<210> 29 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
DNA (primer) 

<400> 29 

aacaagatgg attgcacgca 20 

<210> 30 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
DNA (primer) 



Sequence : synthetic 
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<400> 30 

cgtcaagaag gcgatagaag 



20 



<210> 31 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic DNA 
<400> 31 

gcactctctg cagtcatgcg gctcacggac 30 

<210> 32 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic DNA 
<400> 32 

cccctgtgag gaactactgt cttcacgc 28 
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<210> 33 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: : synthetic DNA 

<400> 33 

ccgggagagc catagtggtc tgcg 24 



<210> 34 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: : synthetic DNA 

<400> 34 

ccactcaaag aaaaagtgtg acgagctcgc 30 



<210> 35 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence : synthetic 
DNA (primer) 

<400> 35 

ggcttgggca cggcctga 18 

<210> 36 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence synthetic DNA 
<400> 36 

gcggtgaaga ccaagctcaa actcactcca 30 

<210> 37 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence synthet ic DNA 



<400> 37 
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agaacctgcg tgcaatccat c 21 



<210> 38 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence synthetic DNA 
<400> 38 

cccgtcatga gggcgtcggt ggc 23 



<210> 39 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence synthet ic DNA 
<400> 39 

accagcaacg gtgggcggtt ggtaatc 27 



<210> 40 
<211> 18 
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ft? ^ 



<212> DNA ^: 
<213> Artificial Sequence \ ^ 

<220> 

<223> Description of Artificial Sequence :: synthetic DNA 
<400> 40 

ggaacgcgac acgctgtg 18 



<210> 41 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence synthetic DNA 

<400> 41 

agctagccgt gactagggct aagatggagc 30 
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